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Preface 


This book is about the numbers yy, 7, e, and 2 and the connections among 
them. My goals are to illustrate to students of calculus how the techniques 
that they learned can be used to discover intriguing and useful results, and 
to compile many of the most elementary proofs related to these numbers 
into a resource for teachers. 

Reading the entire book requires knowledge of integration techniques 
(especially integration by parts), but I try to provide plenty of algebraic 
and geometric assistance so that an undergraduate student of calculus can 
read the book solo. With this in mind, I provide endnotes throughout, in 
case the reader is unable to work out some of the missing steps. They 
appear in the final section, which is called Extra Help. 

Each chapter concludes with a series of exercises, all of which intro- 
duce new historical figures or content. The exercises are neither simply 
routine practice, nor are they intended to be overly puzzling. I outline the 
more challenging exercises in the form of guided tours with the hope that 
any motivated student who has read the text carefully will move fluidly 
through them. 

In short, my hope is that this book will help students see how calculus 
leads to remarkable and elegant truths. 


My thanks to the members of the Spectrum Board, led by Jim Tatter- 
sall and Jerry Alexanderson. While I was writing, Jim Tattersall worked 
through everything that I sent to him, “pencil in hand”, strengthening my 
resolve. I am deeply grateful to him. 

My thanks as well to hard-working and highly organized Beverly 
Ruedi and Carol Baxter at the MAA, and to Woody Dudley, one of my 
writing heroes, who thoroughly copy edited the final draft. 

In 2004, Richard Jacobson, chair of the Houghton College mathemat- 
ics department, approved my special topics course that proved to be the 


origin point of this book. Jake then approved two more goes in the ensu- 
ing two years. He trusted me so benevolently. I am not only thankful to 
him, but to the dozens of Houghton students who enrolled in the course. 
Without them, this dream would have died in its infancy. 

About a decade later, Jay Stine, chair of the mathematics department 
at Misericordia University, allowed me the flexibility to teach a special 
topics course during which we read through my first draft. Like Jake, 
he trusted me, but with far less history to base his trust upon. And as at 
Houghton, I am not only grateful to Jay, but also to the ten students who 
took the course: Kyle Kogoy, Nicole Hunsinger, Alec Garrity, Malachi 
Treaster, Austin Galiardo, David Rudalavage, Melissa Bostjancic, Robert 
Knipper, and finally Lydia Watkins, who not only read through the doc- 
ument line by line, spending hours and hours talking to me about the 
details, but who also gave me a gigantic chocolate chip cookie to cele- 
brate the book’s acceptance by the Spectrum Board. 

I owe a great debt to Misericordia students Mike Gottstein, Taylor 
Rupp, and Craig Siekierka, three students who were funded by the uni- 
versity to read through my second draft over the summer. In her dou- 
ble role as writing major and mathematics major, Taylor was particu- 
larly helpful, and I enjoyed talking to her about the proper use of com- 
mas. Hamilton College students Chenchen Zhao, Alexander Black, and 
Benjamin Oltsik also contributed when I asked for help, proofreading 
and writing programs and finding alternate solutions. My friend Patrick 
Barringer read my entire first draft, and gave me several key ideas that 
ultimately made this book much better. My thanks to all of you. 


I dedicate this book to Michelle LaBarre, my spouse of eight years, who 
chuckled when I said that I was going to describe her as “my constant”. 
It was a pleasant chuckle, but still, I won’t pluck this low-hanging fruit. 
Instead, I will tell you what she often tells her riders as she trains them in 
the art of dressage: if the horse and rider are not generating energy, then 
there is no way to steer. I find that this is true in teaching and learning 
as well. I hope above all that this book brings energy to each one of its 
readers. Thank you, Michelle, for being an ever-present source of energy 
to me. 
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CHAPTER 1 


Co) 


Timaeus (a character invented by the philosopher Plato) spoke to Socrates 
at great length in pursuit of the answer to, “Of what is everything made?” 
Two thousand years later, we still struggle with this question. At times, 
there seems to be no limit to what “everything” is, as we split the smallest 
particles into yet smaller. 

Lacking a microscope, Timaeus appealed to the elegance of propor- 
tion and the evidence of his senses. “Separated from fire nothing would 
ever become visible,” he began, “nor would it become touchable without 
something solid, or solid without earth.” Having posited the fundamental 
nature of fire and earth, Timaeus immediately linked the two via mathe- 
matics. “It’s not possible for two things alone to be beautifully combined 
apart from some third: some bond must get in the middle and bring them 
both together. And the most beautiful of these bonds is that which, as 
much as possible, makes itself and the things bound together one, and 
proportion is suited by nature to accomplish this most beautifully.” 

In fact, Timaeus argued that fire and earth should be bound not by just 
one other element but by two, air and water, and thus by two proportions: 


fire air air water 


: and = : 
air water water earth 


These mathematical relationships, Timaeus suggested, brought harmony 
to the four building blocks of creation. 

Today we see that Timaeus was onto something; the stuff of which 
everything is made does manifest harmonic mathematical unity. One fea- 
ture of the unity is the appearance, in multiple settings, of certain num- 


bers that we call constants. We will discover our first constant, the golden 
ratio yy, by pursuing Timaeus’s line of reasoning. 


1.1. Of what is everything made? 


Timaeus’s link between the four basic elements and proportions prompted 
him to search for geometric solids that could represent the elements. He 
settled on four solids that can be built from exact duplicates of regular 
polygons so that all faces meet at a uniformly constant angle. The tetra- 
hedron’s four faces are equilateral triangles, and the faces of the cube 
are squares (Figure 1.1). Timaeus assigned these solids to fire and earth, 
respectively. 


Figure 1.1 


Earth gives rise to air in a simple but ingenious way. Locate the center 
point of each face of the cube, and connect each pair of neighboring 
centers with an edge, as in Figure 1.2. The octahedron thus formed has 


Figure 1.2 
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six corners, twelve edges, and eight triangular faces. (If we carry out 
this same process inside the octahedron, a new cube is born, and so on, 
forever.) Timaeus associated the octahedron with air. 

Fire’s tetrahedron spawns a smaller version of itself using this tech- 
nique, so Timaeus was forced to search from scratch for a solid to rep- 
resent water. The only other solid with triangular faces that meet at a 
uniformly constant angle has twenty faces. Called the icosahedron, this 
nearly spherical solid inspired Timaeus to associate it with a droplet of 
water. (Skip ahead to Figure 1.5 for a glimpse.) 

Connecting the center of each face of the icosahedron with its five 
neighbors creates the last solid that has the properties of the other four, 
the dodecahedron, a twelve-sided solid with pentagonal faces (Figure 
1.3). As with the cube-octahedron pair, the dodecahedron and icosahe- 
dron nest inside one another in an infinite embrace. 

Having already linked the elements fire, earth, air, and water to the 
tetrahedron, cube, octahedron, and icosahedron, Timaeus assigned the 
dodecahedron to represent the entire cosmos. Such a metaphorically im- 
portant shape surely merits closer investigation. 


Figure 1.3 


If we connect the centers of adjacent pentagonal faces of the dodeca- 
hedron, we create one edge of the icosahedron within. But if we instead 
connect a pair of nonadjacent comers A and B of a pentagonal face, as in 
Figure 1.3, then we create one edge of what will be a different solid con- 
tained within the dodecahedron. Continue connecting nonadjacent cor- 
ners of the faces to create quadrilateral ABCD. The symmetry of the 
dodecahedron implies that ABC'D is a square. Five more such identi- 
cal squares follow by the same technique, resulting in a cube. Thus, the 


dodecahedron contains both the icosahedron and the cube, or, poetically, 
from the cosmos we create water and earth. 

It seems esthetically appropriate to stipulate that the length of each 
edge in the cosmos (that is, each edge of the pentagonal faces of the 
dodecahedron in Figure 1.3) is 1 unit. Now it is only natural to inquire 
about the edge lengths of the cube and the icosahedron that are inscribed 
within the dodecahedron. Earth first: each edge of the inscribed cube is 
a diagonal of one of the pentagonal faces (as Figure 1.3 makes clear), 
so consider the length of AD in Figure 1.4. Diagonal AD is parallel to 


& 


B 4 C 


Figure 1.4 


side BC because the pentagon is regular!!! and so also is diagonal BE 
parallel to side C'D. (As mentioned in the Preface, all marks such as [1] 
refer to the endnotes, which fill in mathematical details.) Thus BCD M is 
a rhombus with sides of length 1. Because / D = 1, then, we know that 
AM = AD—1.Now AM = ME and ZBCD = ZAME = ZAED, 
so 


.,,. AM 1 
AAME ~ AAED, yielding 7 = -p° 
Thus 
AD-AM=1 or AD(AD-—1)=1, (1.1) 


which by completing the square!! gives us 


i= 


This number is the ratio of the cube’s edge length to that of the dodecahe- 
dron that encloses it, and as such reflects the mathematical relationship of 
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earth to the cosmos in Timaeus’s metaphor. This fact alone would make 
this number poetically intriguing, but this fact is by no means alone. 

Before we investigate this constant further, let’s name it. The sculptor 
Phideas, born one generation before Plato, is said to have made use of this 
pentagonal proportion in his works, which include the statue of Zeus at 
Olympia, one of the ancient world’s wonders. The Greek letter phi (the 
first letter of Phideas) denotes this constant: 


14+ 75 
Q= 


= 1.61803. 
2 


1.2 The golden rectangle 


Consider the icosahedron (Figure 1.5). Five equilateral triangles meet at 
each vertex, and their bases form a regular pentagon. Choose any pair of 
neighboring vertices, such as A and B in Figure 1.5, and their opposites 
D and C. Set the length of each edge to 1 unit. Both AD and BC are di- 
agonals of pentagons, and we saw in Figure 1.4 that if a regular pentagon 
has sides of length 1, then ¢ is the length of each of its diagonals. So the 
shape ABCD in the figure is a rectangle with sides proportional to each 
other in the same ratio as the diagonal and the side of a pentagon. The 
icosahedron contains 15 such golden rectangles, each one meeting two 
others at right angles. 


Figure 1.5 


Figure 1.6 


Lay rectangle ABC D flat as in Figure 1.6. Let AB = 1,so AD = y. 
Locate EF so that ABF'E is a square. From (1.1), we know that 


y(y—1)=1, which may be written — = ——. (1.2) 
~— 


But this proportion applied to the figure yields 


AD CD 

AB DE’ 
revealing that rectangles ABC'D and FCDE are similar. We can now 
apply what we just did to the original rectangle to the smaller one, with 
similar results, and this effect will continue as the process is repeated. 
The figure thus created (Figure 1.6) is a sort of spiral composed of squares, 
curling inward toward a point. This intriguing point, which we will la- 
bel w (the Greek letter omega), must lie at the intersection of diagonals 
BD and CE thanks to the repetitive similarity! of rectangles ABCD, 
F'CDE, and so on. This point has been enthusiastically called “the Eye 
of God” by some of its admirers. 


1.3 The Eye, and the arithmetic of 


As w is the fulcrum of spiraling squares inscribed on a golden rectangle 
that spans an icosahedron embedded in a dodecahedron that symbolizes 
the cosmos, it would be a shame not to spend some time thinking about 
its qualities. Along the way, we will use some interesting arithmetic prop- 
erties of y. 
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Sides AD and BC of rectangle ABCD (in Figure 1.6) are parallel, 
so ZCBD = ZBDA, and ZBwC = ZDwE as vertical angles. Thus 
we can deduce that ABwC ~ ADwE, which leads to 


Bu BC _ 


wD DE o=-1- eo 


Back at (1.1), where we first revealed y in the form of the pentagonal 
diagonal AD, we saw that 


g-p=1, or g=yHl. (1.4) 
Dividing by ¢ yields 
1 
pHtlt+ =, (1.5) 
p 


so ¢ is one greater than its reciprocal. Now (1.3) may be rewritten 


Bu 4 ~ 2 
yr. 


ep (+e We 


So the fulcrum point w cuts the diagonal of the golden rectangle into 
segments proportional to the square of y. 
How far is it from B to w? From wD = BD — Bw, we have 


B 2 
"= o Bu=BD.—~_. 
BD — Bw 1+y? 


By the Pythagorean theorem, we may replace BD with \/1 + y?, so 


2 2 
Data 23"), (1.6) 
l+y /1l+¢ 1+? 


We pause to consider *. Starting with (1.4) and repeatedly multiplying 
both sides by vy gives 


P=p+l, 

P=ay ty, (1.7) 
g=~er+¢’, 

p=yity’, 


and so on. Each power of ¢ is the sum of the two previous powers. Now 
we use this list of observations to create a new list of identities!4! in which 


each is the sum of the previous two: 


y=, 
yr =yt+l, (1.8) 
yg? = 2% +1, 
p* = 3y +2, 
yg? = 5y +3, 


and so on. Using this list, we may continue with (1.6) to find that 


yt 3¢ +2 
Bu = = ; 
1+ ¢? pt+2 


With the help of (1.8), we may multiply the numerator and denominator 
by 3 — y to convert'*! the denominator to a whole number: 


Bu =P". (1.9) 


While this is a concise expression for the distance to w from the farthest 
corner of the golden rectangle, we can do better. Focus on AAC'D in 
Figure 1.7, which depicts a regular pentagon. Earlier (see Figure 1.4), we 
concluded that if side CD = 1, then diagonal AC = y. Drop altitude 
AM to bisect CD. Its length is 


AM = /— G2? 


Ke 


, Peso Maeeeer 
G M D 


Figure 1.7 
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by the Pythagorean theorem. Simplifying, we see!®! that 


AM = /G/4(4e2 —D = Vie +3. 


Now AM is the side opposite ZAC M in AAC M, so 


tan ZACM = an = /4p +3. 


Because the pentagon is regular, we know that ZABC = 108°, and thus 
that ZAC M = 72°. Back to (1.9), then, we see that 


tan 72° 
Bu = see 7 

v5 
While many of us learned early on the exact values of trigonometric 


quantities such as cos 30° and sin 45°, not so many have gone on to dis- 
cover such delights as tan 72° = \/4y + 3. 


1.4 The Fibonacci 
(Hemachandra) sequence 


Let’s take a look at the (extended) sequence of coefficients 
1,1, 2,3,5, 8,13, 21,34, 55,... (1.10) 


on the right side of the expressions in (1.8). Each entry is the sum of 
the previous two. This sequence is named for Leonardo Pisano (Italy, 
born c. 1170), known in modern times by the nickname Fibonacci. As 
a boy, Fibonacci traveled to North Africa, where he learned the Hindu- 
Arabic numeral system, which used ten digits (much like 1, 2,3,...,9,0) 
as well as the concept of place value (so that, for example, 23 equals two 
tens and three ones). Fibonacci argued that this approach should replace 
the Roman numeral system that was in wide use in Europe. In a book on 
these topics, Fibonacci introduced the sequence (1.10) as the answer to a 
word problem. 

Perhaps less well known is the description of this sequence by Acarya 
Hemachandra (India, born c. 1088) about fifty years prior to Fibonacci’s 
publication. Hemachandra discussed the sequence in connection with his 
interest in Sanskrit poetry. Sanskrit belongs in the family of the most 
ancient languages, with origins as distant as 1500 BCE. It is a highly 
structured language; the name “Sanskrit” translates as “refined”. 


Sanskrit syllables are either Jong or short, depending on a handful of 
rules defined around vowels. The vowel a has long and short versions, for 
example, and the Romanized transliterations of Sanskrit words use the 
usual bar over a long vowel, as in dsana. When speaking Sanskrit poetry, 
one draws out a syllable with a long vowel for two beats, so Gsana is 
pronounced “aasana’, not quite as four distinct syllables, but instead as 
two beats and one beat and one beat. We can write 

21 £1 
a sa na 


to show the number of beats above each syllable. So writing 


We. Wee2 22'> 32 

ku ti 1a ka ra 
indicates that the first two syllables of the word kutilakara are each spo- 
ken with the usual one beat, but the last three syllables are drawn out 
for two beats each when the word is recited aloud during a poem. So the 
word kutilakara has five syllables but is spoken with eight beats, as in 
“kutilaakaaraa”’. 

Of course, this rhythmic consideration is of utmost interest to poets. 
A poet like Hemachandra, with a bent to mathematics, cannot help but 
ask how many ways that a line of poetry with a fixed number of syllables 
can be built of two-beats and one-beats. For example, the 16 syllables in 
the line 

2 1 2 I ad 222 25 2 By Ae: 52 1 “Dy Do 2, 

Kun da li ku ti 14 ka ra sar pa vat pa ri kir ti ta 
(which, as an aside, translates “Kundali is said to be coiled like a ser- 
pent’) are spoken with a total of 25 beats. A poet who chooses to use 
this 25 beat structure must, by the formal rules of Sanskrit poetry, use it 
throughout the poem. 

So Hemachandra asked himself: how many different 25 beat lines of 
poetry (like the one above) are possible? Listing them all would be te- 
dious. Instead, Hemachandra reasoned that every 25 beat line must begin 
with either a 1 beat syllable or a 2 beat syllable, as in 


1 + 24 more beats 
2 + 23 more beats. 


So if he knew how many 24 beat patterns and how many 23 beat patterns 
there were, then he could add the two totals to get the number of 25 beat 
patterns. Inspired by this insight, Hemachandra began with the simplest 
cases and built toward the goal of counting 25 beat patterns: 
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1 beat pattern: 1 

2 beat patterns: 11 and 2 

3 beat patterns: 111 and 12 and 21 

4 beat patterns: 1111, 112, 121, 211, 22. 
To illustrate Hemachandra’s strategy, note that if we ignore the first sym- 
bol in each of the 4 beat patterns, then what remains are exactly those 
patterns in the previous two cases (2 and 3 beats). So we can add the 
number of 2 beat patterns and 3 beat patterns to find the number of 4 beat 
patterns. In this way, Hemachandra produced the sequence 


1,2, 3,5, 8, 13, 21, 34, 55, 89, 144,... 


where the nth number in the list is the total number of n beat patterns. 
The 25th such entry is 121393, so this is the desired number of 25 beat 
patterns. 

Exercise 2 explains a shortcut that allows us to calculate the number 
of n beat patterns directly, without having to calculate the number of 
nm — 1 and n — 2 beat patterns. (Exercises are found at the end of each 
chapter.) 


1.5 A continued fraction for ~ 


The numbers in the Fibonacci sequence appear as bit players in an in- 
finitely nested fraction that equals the golden ratio. That continued frac- 
tion is 


1+ 


ieee 
and we will warm up to its infinite aspect step by step. 
If we begin to write the continued fraction, stop after one plus sign, 
and simplify, we find that 


1+ ee 
he ae? 
If we stop after two plus signs, we obtain 
1 1 3 
1 =1 = 
i 1 " 2/1 2 
t+ 


Halting at the third plus sign yields 


1 
1+- 
1 


These truncated fractions are called the convergents of the continued 
fraction. In each case, the convergent is the ratio of two consecutive 
Fibonacci numbers. 

We may confirm this link between the convergents and the Fibonacci 
numbers using a proof by induction. Here we will not state the proof for- 
mally, but will instead explore our way into the heart of it. Some notation 
will be helpful: Let F;, be the nth number in the Fibonacci sequence 


1, 1, 2, 3, 5, 8, 18, 21, 34, 55, 89, 144, ... 
and note that the Fibonacci numbers obey the rule 
Frio =FroitF, forn>1. (1.11) 
Now define 


So=1 and S,=1+ 


forn > 1. 


n-1 


This definition of S,, is recursive, meaning that we repeat the rule to 
generate a sequence of results. Here, it begins with 


1 1 2 FP; 

5 Soro ae ae 

Next, we calculate that 
1 1 3. OY 
— 1 — 1 + — d 

S2 Ss ye eam a 
1 1 5 Fs 

S. = 1 — 1 —- — — 5 

: So 5 3 By 


whereupon we state a conjecture that captures this emerging pattern: 


Fn+e2 
nq 


Sp = (1.12) 


1.5 A continued fraction for » 13 


Now if we assume that 


Fr 
Sn = (1.13) 
we can check the truth of (1.12) via 
1 bet ; 
Sy,n=1+ by our original recursive rule 
n-1 
1+ : b tion (1.13) 
a Ce our assumption (1. 
Froi/Fn , 
= Fysi + Fy 
Fysi 
Fy+e2 
=—— by rule (1.11). 
Fag y rule ( ) 


So by assuming (1.13) we can prove (1.12). Such an argument is called 

a proof by induction, and it is valid because (1.13) is a safe assumption; 
after all, we verified it for several cases when we began. 
We may now untether our imaginations and ponder 

1 
Soq'= 1 -—- ——__———_ (1.14) 
1+ 
1+ 


1 
Lars 


where the subscript oo indicates that there are infinitely many plus signs. 
Now when one steps into the infinite, one must tread carefully. After all, 
in what sense could the number of plus signs be infinite? A rigorous way 
to approach (1.14) is given in the endnotes"! but here we simply appeal 
to our intuitive grasp of the infinite. The equation 


1 
al ee (1.15) 


from (1.5) contains a curious self-reference, in that the denominator 
on the right is the very same quantity that appears on the left. We may 
thus replace y on the right with a copy of the entire right side of (1.15) 
to obtain 


1 
g=1+—., 


1+- 
g 
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whereupon we can for the same reason replace y on the right with an 
entire copy of the right side. Nothing stops us from making this move 
repeatedly, or even forever (as long as your intuition allows for a process 
to loop forever). Doing this, we conclude that 


y = 1+ ——___ (1.16) 


1+ 


Tats 
gives us a continued fraction equal to the golden ratio. 

Earlier in the section, we concluded that the convergents of this con- 
tinued fraction are ratios of consecutive Fibonacci numbers. This does 
not guarantee, however, that the convergents actually converge to a limit. 
But exercise 4 provides a lovely graphical demonstration that the limit of 
the sequence {.5;,} of convergents exists. 


1.6 wis irrational 


Now that we have encountered y and seen how it may be represented, 
we might ask what kind of a number it is. For example, we noted that yp 
is the limit of successive ratios of Fibonacci numbers 

3.5 8 13 

rae ee Se aaa 
in the previous section. Is y perhaps itself a rational number? 

Suppose it is, so y = m/n where m and n are natural numbers 
and the ratio is reduced (so 7 is the smallest appropriate whole number). 
Because 1 < y < 2, we can be sure!®! that 0 < m—1n < n. Now from 
(1.5) we have 


1 m m-n n 
=y-1l= l= », OF Y= ; 
yp n n m-—n 


We have just created a ratio for y with a smaller numerator than the orig- 
inal m/n, which we supposed to be reduced. This contradiction implies 
that we could not have hypothesized y = m/n in the first place. So vy is 
irrational. 

And now we approach the claim that y is irrational from a different 
direction. We know that 


1 
y= ae so 2p -1=V5. 
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Thus if y is rational, then so is /5. But /5 is not rational (as we will 
verify now), so y cannot be rational!®! either. 


Claim. V5 is irrational. 


As we did just above in showing that y is irrational, we will argue by con- 
tradiction. Suppose that /5 = m/n where the ratio is reduced. Squaring, 
we get 


5= uae , which can be written ies oh 

n m 

Because /5 > 2, we know that m > n, so the fraction 5n/m has a 
larger denominator than does m/n. This makes m/n a reduced version 
of 5n/m. We can therefore assume the existence of a natural number & 
such that mk = 5n and nk = m. Combining these observations yields 
nk? = 5n or k? = 5, an impossible conclusion, as 5 is not the square of 
any natural number. So /5 cannot be rational as originally assumed. 


The exercises provide a second algebraic argument (exercise 5) and an 
elegant geometric demonstration (exercise 6) of the irrationality of V/5. 
We may also use the continued fraction (1.16) to find that ¢ is irrational, 
as exercise 7 demonstrates. Finally, anyone familiar with how natural 
numbers can be written in their base three (or ternary) representation 
will find the proof in exercise 8 intriguing. 

Ever since the discovery of irrational numbers fractured the Greek 
belief that all numbers were proportions, mathematicians have sorted 
numbers into categories and hunted for numbers that defied existing cat- 
egories. Irrationality has long been a central focus of mathematics. In this 
section, and in the exercises, are a half dozen ways to prove the irrational- 
ity of the golden ratio. But if a proof is an airtight, logical demonstration 
of truth, then why prove a result in more than one way? 

Mathematicians linger on cherished topics, illuminating them from 
a variety of viewpoints, much like artists and poets try over and over to 
capture truths about the human condition. Re-proving something impor- 
tant in a new way brings joy to both the discoverer and the audience. 
Every result in this book, not just the irrationality of y, can be proved in 
more than one way. Imagine that each proof in this book is like a painting 
that one sees upon entering a gallery full of artwork, in which each work 
presents an artist’s unique vision of the same theme. 
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1.7 The arithmetic geometric 
mean inequality 


As this book opened, we saw how Timaeus used the proportions 


fire air air water 


air water water earth 


to link the four elements. Each of these proportions is of the form 


and if we stipulate that p = q+ r as well (as in Figure 1.8), we can 
calculate!®! that ¢q = yr and p = yq. Thus we say that the point that 
separates r and gq cuts the line segment in the golden mean, or we say 
that q is the golden mean of the lengths p and r. Two other well known 
proportions highlight this section, and their relationship will prove useful 
to us throughout the rest of the book. 


Figure 1.8 


The arithmetic mean of a set of n positive real numbers answers the 
question, “What single amount may we sum n times to arrive at the total 
of the n original numbers?” Let A, denote the arithmetic mean of the 
numbers aj, 2,...,@n. To calculate A,,, we split the sum equally: 


a1 + ag ++'' + an 
rs : 


An a 


The geometric mean of such a set of numbers answers an analogous 
question, but about multiplication, not addition. For example, to find the 
final cost of a $100 item that is on sale for 20% off and then an additional 
10% off, we multiply 


100(.80)(.90) = 72. 
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If we ask, “What unique percentage may we use twice in a row so that 
the $100 item again reduces to $72?”, the answer is not the arithmetic 
mean of 80% and 90%. Rather, because we seek a percent P so that 


100(.80)(.90) = 100(P)(P), 


we find that 
P = \/(.80)(.90) = 0.8485. 


This “average” of the percentages may be generalized to define the geo- 
metric mean G’,, of a1, d2,...,@n as 


Gn = (a1a9°° ce, 

The geometric mean of .80 and .90 is less than the arithmetic mean. 
Figure 1.9 shows why the geometric mean of any pair of positive num- 
bers is never larger than their arithmetic mean. Lay segments of length 
a > bend to end as the diameter of a semicircle with center M. Then 
altitude 1/4 N has length equal to the arithmetic mean (a + b) /2. Now an- 
gle EBF is a right angle because!!! it is inscribed in the semicircle, so 
both AF.AB and ABAF are similar to AE BF (and therefore to each 
other). Thus, 

a = “a or AB= Vab, 
which is the geometric mean of a and b. The figure not only shows 
that the geometric mean cannot exceed the arithmetic mean, but also re- 


N 
& 2 

ie 

aN - 
/ \ 
id \ 
Yb \ ee: ~ 

F M E 


Figure 1.9 


veals that the geometric mean equals the arithmetic mean precisely when 
a= b. 

In fact, the geometric mean of any set of positive real numbers never 
exceeds the arithmetic mean, and this relationship will support other 
proofs throughout this book. This result may be stated, 


1 

— (ay + a2 + +++ +n) > (0142 +++ an)i" (1.17) 
with equality!!?! precisely when aj = az = --: = Gn. This claim is 
called the arithmetic geometric mean inequality. The following equiv- 
alent statement allows a simpler proof: let a; = (A;)” in (1.17) and 


multiply both sides by n to obtain 


We will prove (1.18) for n = 3 by way of indicating how a full induction 
proof would proceed. First we prove a claim that will be useful in the 
main argument. 


Claim. If a, b,c, d are positive real numbers with a > b and c > d, then 
ac + bd > ad + bc. (1.19) 


In words, when the numbers c and d swap places, the total decreases. The 
short proof begins with 


(a — b)(c— d) > 0, 
which follows immediately from a > b and c > d. Distributing, we have 
a(c—d)—b(c—d)>0, andso (ac+ bd) — (ad + bc) > 0, 


from which the claim follows. Armed with this maneuver, we tackle 
(1.18) in the case where n = 3, letting x, y, z denote the three numbers 
(rather than A,, Ag, As) for the sake of clarity. 


Claim. For positive numbers x, y, z, we have 

e+y +23 > 3ayz. (1.20) 
Assume that x > y > z Gust relabel the numbers if necessary) and begin 
with the left side of (1.20). Observe that 


a+ y3 4 23 <9 4 yy 4 222 
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and mentally isolate the last two terms. Because y? > z? 


these terms fit the pattern on the left of (1.19). Thus, we can swap y and 
z and introduce the inequality from (1.19) to reach 


andy > z, 


ae + yy +22z2>03+ yz + zy. 
Again we may use (1.19), this time on the first two terms, to write 
ge + yz + zy > xet ya + zy. 


With these two moves, we have promoted a z from the original third term 
up to the first term. Two more judicious swaps bring us to 


gig ya + zy = xryz+ ay aie z*y 


= LYZ + Tye + Lyz 
= 3xryzZ, 


whereupon we have satisfied ourselves that (1.20) is true. A full induction 
proof of (1.18) may mimic this same sort of shuffling. 


1.8 Further content 


1.1 Constructing ¢» geometrically. For millennia, mathematicians have 
limited the tools with which they investigate geometry, often to just 
two, the compass (for drawing circles) and straightedge (for drawing 
lines). The construction outlined in this exercise, in which we split a 
line segment proportionally into the golden ratio, may be performed 
with these two tools. 


On a line, as in Figure 1.10, construct equilateral triangle ABC’. We 
stipulate (for convenience only) that AB has length 2. From side 
BC, build square BCDE. Locate point F' on the original line such 
that CE = CF. 

Explain first why BF = \/5 — 1. (Hint: draw a perpendicular line 
from C to AB.) Then explain why 


AP’ 
ae 

Thus AF is split by B into two lengths that are proportional to the 
golden ratio, and bisecting AF’ yields a segment of length yp. 


Figure 1.10 


2.2 Binet’s formula. Here we derive a closed formula (commonly named 
Binet’s formula, although the result had already been discovered by 
Abraham de Moivre, who we will meet again in Chapter 4) for the 
nth Fibonacci number, that is, an expression that can be evaluated 
for any whole number 7 to obtain the associated Fibonacci number. 


(a) We have seen at (1.4) that y satisfies the equation y? = y+ 1. 


Show that 
i 1- v5 
7 2 


satisfies the similar equation rT? = 7 + 1. 


(b 


wm 


We defined the Fibonacci number F;, to be the nth entry in the 
sequence 1,1, 2,3,5,8,.... With this in mind, consider how the 
argument leading to (1.8) can establish the truth of the identities 


yp” = pF, + Fn_-1 and 
T= TE, + Fy-1, 


expressing the required details in your own words. 


(c 


wm 


Subtract the bottom expression from the top and manipulate the 
result to obtain the desired result 


eG), (1.21) 


Check the formula by calculating any two Fibonacci numbers 
you wish. 
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(d) A slightly faster way to calculate F,, relies on an observation 


about (1.21). The closest integer to y”//5 equals the nth Fi- 
bonacci number; for example, 


5 
Y 24.96 5, 


~ 55.004 = 55, 


and so on. What feature of (1.21) makes this shortcut possible? 


3.3 The harmonic-geometric-arithmetic right triangle. We discussed 
the geometric mean and arithmetic mean of a set of positive numbers 


in section 1.7, concluding that their geometric mean never exceeds 
their arithmetic mean. This exercise introduces a third mean, and 
from the three we will build a triangle that features the golden ratio. 


Like the arithmetic and geometric means, the harmonic mean was 


known to Greek geometers in the time of Plato. In Timaeus (the dia- 
logue featured in the introduction to this chapter), Timaeus hypoth- 
esizes that the Creator designed the human soul in much the same 
way that we calculate the harmonic mean of two numbers. 


(a) In Timaeus’s words, the harmonic mean is “the one exceeding 


(b 


wm 


and exceeded by equal parts of its extremes (as for example 
1,4/3,2, in which the [harmonic] mean 4/3 is one-third of 1 
more than 1, and one-third of 2 less than 2).” Let’s formulate 
this concept. Denote the “extremes” by a and 6 with a < b. (So 
in Timaeus’s example a = 1 and b = 2.) Let x be the “part” of 
each extreme that Timaeus references. Then the harmonic mean 
HAT of a and 0b satisfies both 


H=a+ax and H=b-—bdz. 


Set these equal and solve for x. Then, setting H equal to either 
expression above, conclude that 
2ab 
H= ; 1.22 
a+b ee) 
Check (using algebra) that the square of the geometric mean G 
of a and 6 equals the product of their arithmetic mean A and 


their harmonic mean. That is, show that 


G? = AH. (1.23) 
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4.4 


(c) 


(d 


wm 


(e) 


(f) 


In section 1.7, we used Figure 1.9 to argue that the geometric 
mean G of two positive numbers is always less than or equal to 
their arithmetic mean A. Draw segment BM in Figure 1.9, then 
draw a segment perpendicular to BM from point A to a point 
we will call C on BM. Using ABCA ~ ABAM, argue that 
BC equals the harmonic mean (1.22) of a and b. 


Explain how the figure thus implies that 
H<G 


and give the conditions on which these two means are equal. 

So H < G < A (fora pair of positive numbers). Suppose that 
there is a right triangle with these three side lengths. Then A (be- 
cause it exceeds H and G) must be the length of the hypotenuse. 
By the Pythagorean theorem, we have 


A? = G? + H”. 
Use (1.23) to prove that 


—=y, 1.24 
aa (1.24) 
And so the golden ratio arises in the unique right triangle that 
features three of the most important means. Now as the dessert 
course, use (1.24) to show that a/b = yp. 


The continued fraction for y converges (via graphing). In section 
1.5, we created the continued fraction 


1+ 


eee 


but did not prove that its convergents approach a limit. It so happens 


that continued fractions with all numerators equal to 1 (these are 


called simple continued fractions) have this convergence property. 


Here we will not prove this general result, but instead will see a 


graphical demonstration of the convergence of the continued fraction 


for y. 
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Recall from section 1.5 that we defined the convergents 


So=1 and S,=1+ forn>1 
n-1 
so that 
2 3 5 
Si} 7% 5? 93 3? 94 5? and so on, 


and then proved that they are indeed ratios of consecutive Fibonacci 
numbers. Now let’s turn to Figure 1.11, which shows the curves y = 
x and z = 1+ 1/2 and their intersection at the point (y, y). 


(a) Explain why the curves intersect at (y, y). 
(b) Substitute S_ = 3/2 (marked on the horizontal axis at A) into z 
to find that 


AB= AG) = 28/2) = ° an 


Figure 1.11 
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Move horizontally from B to intersect y = x at C’. Explain why 
point C’ is ($3, $3). 

(c) Point D indicates where x = S3. Mimic the first substitution 
z(S9), this time with Ss, to explain why DE = S4. 

(d) Fully explain now how we can continue following this path, 
which is a sort of rectangular spiral, to the point (y, vy). 


V5 is irrational (via algebra). In section 1.6, we saw a proof by 
contradiction that proves the irrationality of \/5. Here is an alterna- 
tive. 


Note that 2 < //5 < 3, and let a = /5 — 2. If a is irrational then 
so is 5. Suppose, though, that a is rational, so a = p/q where the 
ratio is reduced (so q is the smallest appropriate whole number). 


(a) Multiply a by (v5 + 2)/(V5 + 2) and explain why 
1 p 1 


———, so a : 
at 4’ q at4 


(b) Solve for a and use the solution to find a contradiction to how 
we defined q. 


V5 is irrational (via geometry). We show that y is irrational by 
way of proving that \/5 is as well. In exercise 5, we saw an algebraic 
proof. Here we sidestep algebra with the help of geometry. 


Right triangle ABC in Figure 1.12 has sides 1, 2, and /5. Point D 
sits where CD = CB = 1. We draw ED perpendicular to AC at 
D, making ED = EB. 


(a) Why is AABC ~ AADE? 
(b) Deduce that 
AC AE 
ME Gam oes 

In conclusion: Suppose that /5 = AC /CB is rational. Then 
AE/DE itself is both rational and a reduced version of AC/CB. 
But then we can reapply the argument with AADFE in the role 
of AABC, finding another version of AC/CB that is yet fur- 
ther reduced. Because fractions may not be reduced infinitely, 
we will have reached a contradiction. 
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7.7 ¢ is irrational (via its continued fraction). In section 1.6, we saw 
that y is irrational by proving that \/5 is irrational. Here we prove 
the same truth but by analyzing the continued fraction 


yp=14 (1.25) 


1+ 
eee 


that we first described at (1.16). As in section 1.6, we define S;, to 


be the convergent of (1.25) that is truncated after the nth plus sign. 
So 


Let F;, be the Ath Fibonacci number from the sequence 1, 1, 2,3, 
5,8,... that arises from the property F,42 = Fri. + Fy. Recall 
that we showed at (1.12) that 

Fn+e 
. Fusi 


(1.26) 


(a) The square of any Fibonacci number differs from the product of 
the numbers on either side of it by one. For example, 


F\F3-(F))?=1, FR, - (F3)? =-1, 


F3 Fs = (F;)? = 1, etc. 
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The conjecture 
FyaFiu — (RY = (-1)" (1.27) 
therefore seems viable. Prove (1.27) by induction. 
(b) With (1.26) and (1.27) in hand, explain why the difference be- 
tween consecutive convergents S,,4; and S', is 
(—1)” 


Sy Sn = ; 
a Fy4oF gi 


(1.28) 


(c) From (1.28), deduce that 
p € (Sox, Son41) for all k. (1.29) 
(d) Put (1.28) and (1.29) together to explain why 


lp — S| < for all k. (1.30) 


1 
Fy Pepi 
(e) Now we can prove that ¢ is irrational. Suppose, on the contrary, 
that y is rational and may therefore be written yp = m/n for 
some whole numbers m and n. Now if we substitute m/n for 
in (1.30), we obtain 


| m 
nm 


gS | eZ 
. Py Peat 


Use (1.26) and the fact that the right side of the above is less 
than 1 to find a contradiction. (Once you do this, you will have 
demonstrated that y cannot be written as m/n, implying that 
is irrational.) 


¢ is irrational (via ternary arithmetic). This proof requires knowl- 
edge of modular arithmetic and base three (ternary) representation of 
numbers. But anyone armed with such knowledge should find this 
proof appealing. 


Claim. \/5 is irrational. 


First, we argue that every perfect square has 1 as its final nonzero 
digit when represented in base three. For any natural number q we 
must have 


q=1 (mod 3) or 
q=2 (mod 3) or 


q=0 (mod 8). 
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We study each case in turn. 


If g = 1 (mod 3) then multiplying each side of the congruence by 
q yields 


g=q (mod3) or g?=1 (mod 3). 
So if q in base three has 1 as its final digit, then q? does as well. 


(a) In like manner, argue that gq = 2 (mod 3) implies q? = 1 
(mod 3). 

(b) The final case requires more thought. If g = 0 (mod 3) then 

we know that q is divisible by some power of 3. Let k be the 

largest natural number such that 3* divides q and let r denote the 

remaining factor, so g = 3*r. Now because r is not divisible by 

3, we know that r is congruent to either 1 or 2, and the previous 

cases apply with r in the role of q. 

Put these ideas together to conclude that q? = 1 (mod 8). 

Now on to the proof that V5 is irrational. Suppose v5 is rational, 

so V5 = m/n. Then 5n? = m?. Show that this equation is 

inconsistent. 


(c 


wm 


CHAPTER 2 


Tv 


Bees have populated the earth for tens of millions of years, about as long 
as the land has supported flowering plants. Honey bees gather nectar from 
flowers and produce honey as a food source and beeswax as a storage 
material. They create honeycombs from the wax, a process streamlined 
by natural selection to a mathematically perfect solution. 


To create the hexagonal tiling that is characteristic of a honeycomb, 
a bee secretes wax and fashions it into a cylinder around her body. She 
softens the walls with her own heat and, when she is finished, the surface 
tension acting on the cooling walls pulls them straight. Because the most 
efficient way to pack circles together arranges the circles into a hexagonal 
grid, the resulting polygons are six-sided. 


Only three regular polygons can tile a surface, and among these three, 
the hexagon more efficiently encloses storage area (e.g., for honey) than 
the equilateral triangle or square. To illustrate this point, consider an 
equilateral triangle with perimeter 6 centimeters. Each side is of length 
2 cm, so by the Pythagorean theorem, the triangle’s height is /3 cm. 
The triangle thus has area (1/2)(2)(\/3) ~ 1.73 square centimeters. A 
square with perimeter 6 cm has sides of length 1.5 cm, so it encloses 
(1.5)? = 2.25 square centimeters of area, making it more efficient than 
the equilateral triangle. By contrast, a hexagon with perimeter 6 cm can 
store even more honey. Each side has length 1 cm, and we can cut the 
hexagon into six equilateral triangles as in Figure 2.1. The area of each 
triangle is (1/2)(1)(V3/2), so the area of the hexagon is 63/4 ~ 2.60 
square centimeters. 
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Figure 2.1 


2.1 Liu Hui approximates 
7 using polygons 


Hexagons are but a byproduct of the process of making honeycombs; 
bees, in fact, exhibit the finest efficiency by crafting circular chambers. 
The mathematical community grasped the import of circles early, includ- 
ing the hunch that circles optimize storage area. In the third century, Liu 
Hui (China) explained how one might explore this unique feature of cir- 
cles by repeatedly doubling the sides of a polygon, creating a sequence 
of polygons that ever more closely resembled a circle. (Five hundred 
years earlier, Greek mathematician Archimedes captured the value of 7 
in much the same way.) 

Liu Hui’s strategy may be phrased thus: inscribe a hexagon in a unit 
circle and find both the area and the side length of the hexagon. Use this 
information to find the area and side length of a 12-gon inscribed in the 
same circle, then repeat to reach a 24-gon, then a 48-gon, and so on. Each 
polygon has double the number of sides as the last, so the sequence of 
areas will grow ever closer to the exact area of the unit circle, which has 
area 7 square units. 

This geometric approximation of 7 iterates fairly easily thanks to a 
formula that calculates the side length of a 2n-gon given the side length 
of an n-gon. In his solution, Liu Hui does not use trigonometry, which 
had yet to be developed in China. In Figure 2.2, the circle with center Q 
has radius 1 and therefore area equal to 7 square units. Inscribed in the 
circle sits an n-gon (a hexagon, for example) with just one of its sides 
AB shown. We bisect AB at M and draw a line from Q through M to 


2.1 Liu Hui approximates z using polygons 31 


Figure 2.2 


locate a new vertex C' for our 2n-gon. (In the same way, we would bisect 
every other side of the n-gon.) 

We wish to find the area of the 2n-gon in terms of the n-gon’s fea- 
tures. If we let AB = a, and note that QC is perpendicular to AB, then 
we can calculate that the area of AQ AC (by using its altitude AM = a/2 
and base QC' = 1) is a/4. Because 2n such triangles tile the 2n-gon as a 
whole, we conclude 


area of the 2n-gon = ae . (2.1) 


So from a and n (the side length and number of sides of our n-gon), 
we can find the area of the 2n-gon. If we wish to iterate this process, 
however, then we need to calculate the side length AC = 6 of the 2n- 
gon as well. 

We may connect a from the n-gon to b from the 2n-gon by means of 


the Pythagorean theorem, starting with 
a\2 
(qu) +(5) <0 


from AQM A. Combining"! this observation with QC = QM+MC = 


1 yields 
1 
MC =1- 54-0. (2.2) 
Applying this result to AAMC (and simplifying!'*!), we find that 
b? =2-—V/4-a? (2.3) 


as desired. 
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Let’s return to the hexagon with sides of length 1 and area 2.60 from 
earlier in this section, and use its information to show (2.1) and (2.3) at 
work. The hexagon’s six sides make n = 6 and we have a = | as the 
length of each side, so from (2.1) we know that the area of the associated 
dodecagon (that is, 12-gon) is 3. Now if we wish to again double the 
number of sides of our polygon to more closely approximate the area of 
a circle, we can use (2.3) to obtain 


b= V2 V4 P= 2 V3 (2.4) 


as the length of one side of the 12-gon. If we wished to repeat the pro- 
cess!!! to find the area and side length of a 24-gon, we could do so, and 
so on for as long as we wish. Each polygon’s area would approximate 7 
even more closely than the area of the previous polygon. 


This approach allows us to approximate the area of a circle with ra- 
dius 1 as closely as we wish. Liu Hui doubled the sides of his original 
hexagon five times to determine that the area of a 192-gon is a bit more 
than 3.1410. He also repeated the analysis, this time with the unit circle 
inscribed within his polygons, to obtain an upper bound of 3.1427. 

About fourteen centuries later, the scientific community adopted the 
Greek letter 7 to represent this value that Liu Hui had ensnared with his 
clever geometry. During those many years, other minds joined the search 
for a closer approximation of Liu Hui’s 


3.1410 < m < 3.1427. 


For a long while, thinkers used versions of Liu Hui’s idea. In the 1400s, 
for example, Jamshid al-Kashi (Persia) found the equivalent of 17 digits 
of 7 by calculating the perimeter of a polygon having 3 x 278 
And while these ingenious and persistent efforts bear mention, our focus 
now turns away from this tale of polygons to consider an approximation 
strategy with a wholly different genesis. 


sides. 


2.2 Nilakantha’s arctangent series 


One of the earliest alternatives to using polygons to approximate 7 origi- 
nated in the work of Nilakantha Somayaji (India, born 1444). In part in- 
spired by his interest in astronomy, Nilakantha discovered what we now 
call infinite series for trigonometric functions. His series for the inverse 
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tangent function arctan z, in particular, leads to a wholly new method 
for approximating 7. 

Nilakantha’s development of the series for arctan x predated any 
deep understanding of integration by two centuries, so the rest of this 
section, while straightforward, does not follow his path. 

We take the geometric series 

1 2 3 4 
fag + UP 4+ U5 ++: (2.5) 
as our starting point. The series converges!'® for x € (—1,1). If we 
replace x with —x”, we establish the result 


1 


fae Oe ee Se ee (2.6) 


as a lemma for what follows. 
Consider y = arctan & and its equivalent identity x = tan y. Implicit 
differentiation with respect to x yields 


dy 
1 = sec? y+ 
d dx 
and, using the identity sec? y = 1 + tan? y, 


dy _ 1 
dx 1+tan?y~ 


Because x = tan y, we have 


d d 
i oe arctan © = 


dx dz 1+ 2? 


and can apply our lemma (2.6) to the right side, then integrate both sides, 
obtaining 


y= f Q-at+ot-a9 +28 —.--) dx 


or 


arctanz = x . + + vee (2.7) 


as promised. 
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As with series (2.6) that triggered this approach, series (2.7) con- 
verges on the interval « € (—1, 1). Happily, the series also converges for 
x = 1, which produces 


paisa tg (2.8) 


from (2.7). This formula for 7 was not only discovered by Nilakantha 
but also independently by James Gregory (England, born 1638) and 
Gottfried Leibniz (Germany, born 1646), among others. 

Now while formula (2.8) elegantly expresses 7 as an easily deci- 
phered pattern, it approximates 7 at a crawl. Achieving 7 * 3.1417 re- 
quires about ten thousand terms. In response to this inefficiency, 
Nilakantha examined the partial sums of (2.8), and their errors, to craft 
the far more efficient 


at A 4 4 
4 (EAL SAS as : 


which with just five terms trumps the first ten thousand of (2.8). While 
we will not plunge into the details of this result here, it bears mentioning 
as testimony to the power of pre-calculus mathematics and the ingenuity 
of those who mastered it. 


2.3  Machin’s arctangent formula 


A few decades after James Gregory rediscovered Nilakantha’s formula 
(2.8), John Machin (England, born 1686) found another way to coax the 
arctangent function into approximating 7. In just three terms, Machin’s 
formula reaches 7 ~ 3.1417, thus generating digits of 7 much more 
quickly than Nilakantha’s (2.8). After we follow Machin’s path to his 
formula, we will return to this amazing fact. 

In Figure 2.3, the unit circle with center Q has radius QA and tan- 
gent AF, making 7Q AE aright angle. We place B so that 7AQB mea- 
sures 7/4 radians. As Machin himself did, we observe that tan7/16 ~ 
0.1989 =~ 1/5. Now we locate point C' such that AC = 1/5. Let a de- 
note ZAQC. Because the circle has radius 1, we know that tana = 1/5. 
This means that a is slightly larger than 7/16. 

Now double a to locate point D. By the trigonometric identity 

tanA+tanB 


tan(A+ B) = 2. 
ont ) 1—tanAtanB’ Go 
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we calculate!!! that tan (2a) = tan (a + a) = 5/12. A second applica- 
tion of the identity returns tan (4a) = 120/119, which is slightly larger 
than 1. Note that therefore AF = 120/119. 

Applying (2.9) to 


AE = tan (1/4+ ZBQE), 


we obtain 
120 tan (m/4) + tan (ZBQE) 


119 1-—tan (7/4) tan(ZBQE) 


Solving!!*! for tan ZBQE yields tan(ZBQE) = 1/239. Finally, be- 
cause 


“ = 4a — ZBQE, 
we have Machin’s formula 
‘ = Aarctan (1/5) — arctan (1/239). (2.10) 


Let’s return to the claim, made at the beginning of this section, of the 
efficiency of Machin’s formula. Using just the first three terms of (2.7) 
to calculate 


arctan 1/5 = 1/5 


Ds _ 4/5)? ~ 0.197397, 


5 
(1/239)3 — (1/239)5 


arctan 1/239 ~ 1/239 3 + 0.004184, 
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we find that 


m & 4(4(0.197397) — 0.004184) = 3.141621, 


which differs from 7 by only about 0.00003. 
Intrigued by this approach, other scholars since Machin have contin- 
ued to create formulas similar to his, including 
1 1 1 1 
“ = 44 arctan 57 +7 arctan 339 12 arctan 682 +24 arctan 72943’ 
which was used in 2002 (about a century after its discovery) to set a 
(then) world record for calculating digits of 7. In exercise 1, we will 


see a derivation of a formula similar to Machin’s that uses only basic 
geometry. 


2.4 Wallis’s formula for z /2 (via calculus) 


The area of a quarter of a unit circle is 7/4, and with today’s calculus!'?! 


we find it via the integral 


1 
| (1 — 2)/? dex. 
0 


When John Wallis (England, born 1616) turned his attention to 7, inte- 
gral notation had yet to be invented, such was the infancy of calculus. 

In 1655, Wallis discovered a remarkable formula that could compute 
the digits of 7 in a straightforward way. His formula, which segregates 
the even and odd natural numbers, is 

wr 2 2 4 4 6 = 6 


= -X=-=X=Xs>K 5K oXKeee (2.11) 
2 1 3 3 5 5 7 


Wallis’s original derivation rests only on algebra, geometry, and intu- 
itive pattern seeking. His efforts appear not here but in Appendix A, sim- 
ply due to the sheer amount of work involved. (Exercise 2 gives a mod- 
ern proof that uses a similarly elementary approach.) Here we follow a 
shorter path as provided by calculus via the family of integrals 


n/2 
“on 
an =f sin” x dx. 
0 
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The first two integrals evaluate to 


n/2 n/2 rT 
a= | sin? 2 de = [ 1ldx=— and 
0 0 2 


n/2 n/2 
a= | sinx dx = [ - cos | =1 
) 


(0) 


To calculate 


n/2 
a2 = ip sin?2 dx weuse sin? 2 = 5 
0 


to get 


4 i 


*/2 (1 — cos(2z) 7 
=> d. . 
= | D 4 


To calculate a3, we split sin? x as in 


n/2 n/2 
a3 = | sin? ¢ dx = | sin? x sin x dx 
0 0 


and then apply integration by parts with 
u =sin? x so du=2sinzxcosz dz, 
dv=sinxdx so v =—cos2, 
which leads to 
2 m/2 n/2 
a3 = | — cosrsin | +2 / sin x cos’ x dx. 
0 0 

The first expression equals 0, and we use the identity 

cos? g = 1—sin?z 


to write 
m/2 
= ‘ 28 
a3 = 2 | sin x(1— sin* x) dz, 
0 


1 — cos (22) 


or 
a3 


m/2 n/2 
= if sin x dx — | sin? x dx. 
2 0 ) 


The first integral is a; and we calculated that a; = 1 above. The second 


is a3, SO 

a3 2 

gt OS or 03 = 3° 
This process hints at a recursion, so we apply these moves to the general 
case, namely 


m/2 m/2 
ee = Peet ‘: 
An = i sin” xdx or Q@n= ‘ sin xsinax dz. 
ty) 0 


Integration by parts with 
u=sin"-'x so du=(n—1)sin"* xcosz dz, 
dv=sinxdx so v =~—cos@, 


leads to 
m/2 
<n 2 a 
ay = [—cosa sin” 1 gr! +(n- yf sin”? x cos” x dz. 
0 


As before, we find that the first term equals 0. We can replace cos? x in 
the integral by using the identity cos? 2 = 1 — sin? x, then distribute to 
split the integral, obtaining 


ai n/2 n/2 
t= sin”? x dx — sin” x dx. 
n— 1 0 0 


Now the first integral is just a,,_2 by definition, and the second is ay, so 


An n—-1 
=4n-2—4n, OF An = 
n—-1 n 


Recursion (2.12) yields 


Be (2.12) 


TT 
a 5. aj =1 
1 sea 2 
ag = 3 x 2 ag = 3 x1 (2.13) 
3 log 4 — 2 1 
Was a 5 Ws ex a x 
5 3 1 fF 6 4 2 
ag=—X-xX=xX= a7==xX=-x-xil 
6 4 2 2 pe 3 
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and so on, leading to the general terms 


2 2B Ey Ae Lowa 
oc ae) ea ae rae 
2n 2n —2 2 
sce eg og a Fee eee 8 ae 2.14 
Ranh = pera) Saeeg Gi) 


Now we will show that the limiting ratio of these two general terms 
equals 1. The function sin x is bounded by 0 and 1 on the interval [0, 7/2], 
so we know!" that 


in??? 2n+1 


s x < sin x < sin?” zg. 


Integrating each from 0 to 7/2 gives 
Gan+2 S Gan41 S An. 


But with (2.12) we can replace a2,,+42 to obtain 


2n—-—1 


Gan S Aant1 S< Gon, 


whereupon we divide by az,, and then take the limit throughout: 


n—-1 a 
2 line 


noo 2n n+00 Gon 
Because the left limit equals 1, we conclude that 


: G2n4+1 : : 
lim @"*+=1 or lim Gdont1 = lim aan. 
N00 Gan n—-0o noo 


The symbol =< commonly denotes that two expressions are approaching 
each other in their limits, so we may write 


Q2n ~ 42n+1; 


or, from (2.14), 


anal © Pie, Ss dag 2n Wee ae ye 
2n Qn —2 92 In On 1 5: 
Solving for 7/2, we find that 
2 2 4 4 6 2 2 
oes _ (2.15) 


ge ha ae > oR 


Now letting n — oo, we have captured Wallis’s formula. 
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2.5 Aconnection to probability 


With Wallis’s formula fresh in our minds, we will observe a connection 
between it and a simple probability question with an answer that plays a 
role in deeper results. The probability question is: 


If a fair coin is flipped 2n times, what is the probability P2,, 
that the coin comes up heads and tails an equal number of 
times? 


To calculate this probability, we divide the number of outcomes that re- 
sult in n heads and n tails by the total number of possible outcomes. 
(The endnotes?" provide a numerical example, if needed.) Because the 
n heads might appear as any of the 2n flips, there are 


2n\ (2n)! (2)! 

nj} ni(Qn—n)! nin! 
outcomes with n heads (and therefore n tails). We divide this quantity by 
the 2?” possible outcomes to conclude that 


Bi pis (2.16) 


nin! ~ nin!27 2" (nl2”)? 


Because!”?] n! 2" = (2 x 4 x 6 x --- x 2n), we can write (2.16) as 


(2n)! 


Pon = : 
: (2x 4x 6x +++ 2n)? 


(2.17) 


Now we will use Wallis’s formula in (2.17). First, we rewrite!?*! (2.15): 


|| QRaK enya 


x : 2.18 
2 3x5x-++x(2n—1)] 2n+1 ee) 


If we multiply the numerator and denominator of the fraction inside the 
brackets by the numerator itself, as in 


2x4x---x(2n) 2x4x---x (2n) 
3x5x-+-x(2n-—1) 2x4x---x (2n)’ 


then we may write (2.18) as 


mn [(2x4x6x-+x2n)?] 1 


2 (2n)! QIn+1" 


2.6 Wallis’s formula for 7/2 via (sin x)/x 41 


Using (2.17), we get 


which simplifies to 


Pan X< (2.19) 


a(n + 1/2) 
Thus, as we flip a fair coin, the probability of flipping exactly as many 
heads as tails is approximated by the expression containing 7 on the right. 
As we might expect, this probability vanishes to zero as the number of 
flips grows arbitrarily large. 

In section 3.7, we will use this approximation to answer a less spe- 
cific, and therefore deeper, question. 


2.6 Wallis’s formula for z /2 via (sin x) /x 


Eighty years after Wallis published his formula 


Te ee es les 
alan ae) tan ae) ae a ; 
the Swiss mathematician Leonhard Euler (Switzerland, born 1707) dis- 


covered the infinite product 


a = (1 ae | ae rae | me ae oe 


and, by substituting 7/2 for 2, rediscovered!**! Wallis’s formula. 
Euler arrived at (2.20) via a risky hypothesis about the zeros of the 
curve y = sin x. To prepare for his hypothesis, we recall how the zeros 


of a polynomial relate to its factors. To take an example, the polynomial 
32° — 21a? +422 —24 factorsas 3(x—1)(x —2)(x —4), 
so its zeros are the solutions to 
3(a — 1)(a — 2)(a — 4) =0, 


which clearly are x = 1,2, 4. Changing the constant 3 to another con- 
stant has no effect on the zeros. Reversing this process, we can build a 
polynomial that has zeros at (for example) « = —1,0,3,5 by writing 


A(z + 1)(a — 0)(a — 3)(a — 5) =0 


and then choosing any constant A. 


42 2a 


Euler’s risky hypothesis extends this idea to the curve y = sinz, 
which has infinitely many zeros (at all of the multiples of 7, including 
0). So Euler guessed that he could write 


sina = A(x — 0)(a — 7) (a + 7) (a — 277) (a + 27) (a — 37) +--+ (2.21) 


in a sort of infinite factored form. (This audacity, which is a hallmark 
of Euler’s scholarship, often characterizes the work of people who we 
recognize as geniuses.) The constant A can be determined by dividing 
through by « and taking the limit of the result, as in 


= lim A(@@ — 7) (a + 2) (a Qn) (a + 27)(a — 37)(a# + 37)---. 


The limit on the left equals 1. On the right, we may directly substitute 0 
for x. Thus we reach 


1 = A(—1)(m)(—2n)(2m)(—3m)(3) --- 


and can solve for A. Substituting this solution into (2.21) leads to!?>! the 
desired identity 


M(H gE) Om 


mentioned at the opening of this section. This infinite product is the gen- 
esis of many intriguing corollaries, not just Wallis’s formula. Exercise 
3, for example, explains how we can convert an infinite product such as 
(2.22) to a continued fraction. Applying the conversion to (2.22) yields 
the lovely 


1+ 
1/2+ 
1/3 + 
1/4+--: 
as Exercise 4 explains. 


2.7 The generalized binomial theorem 


Throughout this chapter we will require a result called the generalized 
binomial theorem that is due to Isaac Newton (England, born 1642). 
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His original derivation appears in Appendix B. The theorem provides an 
expansion for expressions of the form 


(a +b)” 


where 7 is not restricted to whole numbers. 


The theorem states that we may expand (a + b)” using 
(a+b)" =a"+a10"" 10+ a2 a" 7B? +0307-7 BF + +> (2:23) 


where the coefficient a,,, equals the product of the first m terms of 


nn-l n-2 n-38 


a co eee 


For example, to expand (a + b)? (where n = 3), we first find that 


3 
ee a | =3, 
3-2 
ae =3, 
iy op 
ee er 
S200. 
re yO a 


and so on, yielding 
(a+ b)? = a? + 3a7b + 3ab? +b? 


as expected. When n is a natural number, the coefficients a, are the 
binomial coefficients 


Newton argued that this same approach works for exponents that are 
not whole numbers. For example, to expand (a+ )!/? (where n = 1/2), 


we find that 
a= 12 = 


and so on, so 
(atb)¥? a q24 oP b q73/2 b2 | q7?/2 b3 5q7 7/2 pA 
2 8 16 128 


We can similarly calculate that 


2 5a@2 —-10a* 22° 
1 1/3 = 1 x x 
Sa 3. 9° Br 243 7 79 


Later, we will want to expand expressions such as (1 — x)" where the 


exponent is a variable. To do this, we substitute a = 1 and b = —z into 
(2.23) while leaving each n untouched. Thus the expansion is 


er aig)" = 1" 4 + (1)""\(-2) a ; “ 1 qyn-2(—2)? Ae 
hy Mee) n(n—1(n—2) 03 | 
1 1-2 12.3 


This brief training in the generalized binomial theorem is no substitute 
for a proof. Knowing this, Newton tested his theory in various ways be- 
fore releasing it to the world. Appendix B describes his original discov- 
ery. 


2.8 Euler’s (1/2)! = 7/2 


Sometimes a mathematician sets out to express a constant such as 7 in 


an intentionally elegant way, perhaps as a continued fraction or as an 
infinite product. Occasionally a problem that initially seems unrelated to 
a specific constant reveals itself to be a path directly to it. Euler’s search 
for a general expression that would yield the partial sums of 


11+ 2!+3!+414+514--- 
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stands as a classic example of such serendipity. 
The partial sums of the series are 


1, 3, 9, 33, 1538, etc. 


Euler set out to find a formula that would generate any entry of this se- 
quence. In this search, he found the family of integrals 


1 
| x™(1— 2)” da 
0 
particularly useful. To evaluate this family of integrals, we may expand 


aoe n n(n—1) 5 n(n—-1)(n-2) 5 
(l-a)"=1 72+ 12 2 oe x + 
by using the generalized binomial theorem (2.23). We multiply by x” to 
get 


m i= n _ Mm m+1 
xz™(1— 2) x x + 7) 
n(n — V(r 2) ms 
1-2-3 7 
and then integrate: 
rt ngmt2 n(n — 1)a™*8 


eee — 2)" dx 


As a definite integral, this becomes 


~m+l im+2)' 1-2(m+3) 


1 n n(n — 1) 
= + 
m+1 l1(m+2)  1-2(m+3) 


1 
| x” (1—-2)" dx see. (2.24) 
0 


Let’s work on rewriting the right side. If we successively evaluate the 
right side at n = 0,1,2,3,... and simplify, we find!?®! that 


1 
n= 0 produces | 
=1 produces d 
Nee (m+ 1(m +2) 
1-2 
=2 d 
n produces me DGn +O eS) 


n= 3 produces 


(m+ 1)(m + 2)(m + 3)(m + 4) 
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and so on. In general (Euler wrote), the nth term appears to produce 


n! 
(m+1)(m + 2)(m+3)---(m+n)(m+n+4+]1) © 


We can simplify this a bit if we multiply both numerator and denominator 
by m! to get 
m!n! 
(m+n+1)!° 
Now recall that (2.25) simplified the right side of (2.24), so we can write 


(2.25) 


m!n! 


1 
i FEE SOE Gaeaacriytt 


Something wonderful happens when we specifically choose m = 1/2 
and n = 1/2 in this identity. Substituting, we get 


* We _ 2 dy = —{1/2)!41/2)! 
[ a mary cee 


which may be written 


2f Va — x? dx = [(1/2)!]. (2.26) 


The factorial on the right is quirky. Recall that we define n! by 
nl! =n x (n—1) x (n—2)x-+-x3xK2x1 


when n is a natural number. The unusual number (1/2)! defies this def- 
inition. But as we will now see, the integral on the left of (2.26) equals 
a real number, allowing us to calculate (1/2)! despite its strange nature. 
The integral 


1 
2 | Vax— x? dx (2.27) 
0 


on the left of (2.26) may be evaluated by trigonometric substitution, but 
it is much simpler to recognize the integrand x — x? as representing a 
semicircle, as follows. Let 


y=Vae-—2? of yr =a—2? 
and shuffle the terms to get 


g?—ae+y’ =0. 


2.9 The Basel problem: > 1/k? = 1?/6 47 


By adding 1/4 to each side, we complete the square, so 
geome tl/4+y2=1/4 or (¢-—1/2)? +9? = (1/2). 


This is the equation of a circle with center (1/2, 0) and radius 1/2. Now 
the limits in (2.27) are x = 0 and x = 1, which perfectly capture the 
upper half of this circle. The area of a semicircle is (1/2) mr? so with 
r = 1/2 we can conclude from (2.26) that 


1 
2 | Va — x? dx = 2- (1/2) x(1/2)? = “ . 
0 
Returning with this result to (2.26), we see that 
(1/2)! = wi ~ 0.88623. 


Euler did not content himself with this peculiar result, but we will depart 
from him here, now that the constant 7 has manifested itself in the pursuit 
of a seemingly unrelated inquiry. 


2.9 The Basel problem: 5° 1/k? = 77/6 


In 1644, mathematician Pietro Mengoli (Italy, born 1625) posed a ques- 
tion that he could not answer: what is the sum of the reciprocals of the 
perfect squares? In symbols, Mengoli asked to what did the series 


es Cees MO 
tort at ae (2.28) 


converge. Almost a century passed before Euler solved this puzzle. One 
of his solutions uses trigonometric substitution and the power series for 
arcsin x, and it testifies not only to the creative power of calculus but 
also to Euler’s ingenuity. We begin with the definite integral 


T arcsin x 
0 Vl—2? 


and evaluate it using the substitution 


dx (2.29) 


x=sin@, so dx=cosédé and arcsinz = 0. (2.30) 


The identity cos 9 = \/1 — sin? 0 gives us 


cos@ = V/1— 2? 


48 on 


and the limits x = 0 and x = 1 become 0 = 0 and 0 = 77/2, so we may 
write (2.29) as 


arcsin £ */? cos dO _ 6? 7/2 9? (2.31) 
faz 5 gosf 2 |g 8" ; 


Now we evaluate the original integral (2.29) a second way, by replac- 
ing?” arcsin x with its power series 


arcsin x = x2+ x 4 x 4 gi pees 


2-3 


to reach 


1 gdaz Xe 1 tg dx a 1-3 vi: x? dx 
9 Vl—2 2-3 o Vl—-2x? 2-4-5 1 


Each of these integrals is of the form 


1 gant dx 
b] 
0 Vl—x2 


and via the substitution (2.30) from earlier, we may rewrite them as 
n/2 
| (sin?! 0) dé. (2.33) 
0 


They belong to the family of integrals that we used in section 2.4 to 
prove Wallis’s formula. Specifically, our integrals here in (2.33) are the 
odd members of that family, and are evaluated in the right column of 
(2.13). Using those results to simplify (2.32), we obtain 


et 2 no hee A 23 BES 6 4 2 
6g 80° Podeb. 5. 3 Dano, FF GB 3B 


or, simplifying and then linking the result to (2.31), we have 


1 
= statatat- (2.34) 


This answers the question, “What is the sum of the reciprocals of the odd 
perfect squares?” Not quite Mengoli’s query, but close enough that with 
a cute trick we can bridge the gap. Let S denote the sum of Mengoli’s 
series (2.28), so 
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or, sorting odds from evens, 


of a 1 1 1 1 1 
S= ptaetpt + ptptgt é 
By (2.34) we know that the expression in the first set of brackets equals 


‘ia /8, and, factoring 1/4 from each term in the second set of brackets, 
we have 


12 325? 4\ 1? 2? 3? 
or simply 
wd 
SSS pes. 
8 ¥ 4 
Solving for S yields 
2 
T 
fo ares 2.35 
6 (2.35) 


and Euler’s solution to Mengoli’s vexing puzzle is complete. 

Before Euler solved the problem, Jacob Bernoulli (Switzerland, born 
1655) published some partial results in Basel, Switzerland, and thus “the 
Basel problem” became its nickname. Coincidentally, Basel was Euler’s 
birthplace. In exercise 3, we will see how complex numbers allow for a 
more streamlined solution to the Basel problem. 


2.10 7 is irrational 


In section 1.6, we classified the golden ratio ¢ as an irrational number 
(by several different arguments, including a proof that 5 is irrational). 
The number 7 is likewise irrational, as we will see in a moment. As an 
aside, we note that although both y and 7a are irrational, we say that 7 
is transcendental while ¢ is not. A transcendental number cannot be the 
zero of a polynomial with integer coefficients. Because y is a solution to 


z?—2—1=0, (2.36) 


as discussed at (1.1), it is therefore not transcendental. It can be shown 
that 7 cannot never be the solution to an equation of the form (2.36), 
where the coefficients are integers, so 7 is transcendental. Such a proof 
is beyond the reach of this book, but with some clever integration, we 
can at least show that 7 is irrational. 
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We begin by evaluating 


: " EG. 


where f(a) is a function to be named later, using integration by parts 
with 


u= f(z) so du= f'(x) dz, 
dv=sinxdx so v =-—cos@. 


This yields 


i f(x) sina dx = | - fe ) cos 2| ref f(a) cosa dz. 


The first expression on the right simplifies to f(a) + f(0). The integral 
on the right, with 


u= f'(2x) so du= f" (x) dz, 
dv=cosxdx so vu =sing, 
may be written 


[ f'(x) cosx dx = lf") sina] h-£ f’ (x) sina da. 


This time, the first expression on the right simplifies to zero. Now all of 
these pieces together give us 


f(x) sina dx = f(r) + f(0) -f ff (e\sin ede: (2,37) 
0 0 


This last integral matches the original (on the left) with f” in the role of 
f, so we can predict that 


[reo )sina da = f(r) + f"(0) )- fre) )sin a da 


and so on. Thus from (2.37) and this prediction we conclude that 
f s(osineae = [f2) - £70) + FG) -— FO) +] 
0 


+ | F(0) — F"(0) + fO(0) — F(0) +++]. 2.38) 
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If we choose f(2:) to be a polynomial of degree m, then every derivative 
after f( in each bracket will equal zero, and (2.38) will become 


[ fe)sinzde = [f() - £0) + 1) = EF) 
0 


t (FO) — FO) + £90) —-- £ FO]. 239) 


With (2.39) established, we begin a proof by contradiction by assuming 
that 7 is rational, so 


oe 
qd 


for some natural numbers p and q. Now the heart of this proof hinges on 
the discovery of a polynomial f(x) of degree m such that 


(1) For every m, every term on the right of (2.39) is an integer, yet 


(2) For some m, the integral on the left of (2.39) is not an integer. 


The polynomial 

f(x) = an (2.40) 
(where n is a natural number) does the trick. If we were to expand the 
numerator, we would find that the degree of each term ranges from x” 
to 2”. This has two!?®! implications: the first n — 1 derivatives of f(a) 
equal zero when evaluated at x = 0, and the denominator n! will be 
fully canceled as of the nth derivative. So, because p and q are natural 


numbers, then 
f (0) will be an integer for k = n,n +1,...,2n. 


Every term within the second pair of brackets in (2.39) is therefore an 
integer. To show that all terms within the first pair of brackets are also 
integers, we appeal to the symmetry within f(a) by noting!*! that 


f((p/q) — 2) = f(z). 


That is, the polynomial f(x) is identical to the polynomial 
f((p/q) — x), and so their respective derivatives match as well. Because 
f (0) is an integer (as we have just seen), then 


f((p/a) — 0) = f (v/a) = f(x) 


is an integer (and the same holds for their derivatives). These observa- 
tions establish the first of our two claims. 


52 2a 


Next (and finally) we wish to prove the second claim: that for some 
m, the definite integral on the left of (2.39) is not an integer. With our 
polynomial (2.40) replacing f(a), the integral becomes 


| a ea (2.41) 
0 


n! 


and (as previously noted) the degree m of the polynomial is 2n. Let’s 
maximize each of the three factors in the numerator 


x” (p— qx)” sina 


one at a time on the interval x € (0,7). We know that 0 < sing < 1 
when x € (0,7), and clearly the factor x” is maximized at x = 7, so 


x" (p— qx)" sing < 1" (p— qa)”. 


The curve p — qx is a straight line with y-intercept p and negative slope, 
so its maximum equals p on our interval. Thus, 


™"(p— qa)” <1"p” = (mp)”. 
We now have an upper bound for the numerator in the integral (2.41). All 


three factors in the numerator of the integral are positive for x € (0, 7). 
We can conclude, therefore, that 


0< [ ea Na dz < a kept dx a= PE dx. 
0 n! 9 ni 9 ni 
(2.42) 
The integrand p” /n! is positive and, for a suitably large value of n, it can 
be made!*°l as small as we wish. In particular, we will choose n so that 
p” /n! is less than 1/7"** so that the last expression in (2.42) is less than 
1, and therefore not an integer. The second claim is thereby proved. 

So, under the assumption that 7 = p/q we showed that there is a 
polynomial f(a) that creates a contradiction in (2.39). That is, there ex- 
ists an n so that the left side is not an integer while the right side is. 
Our assumption that 7 is rational must therefore be incorrect, and we are 
finished. 


2.11 Further content 


2.1 A geometric derivation of a formula for 7. Machin’s formula 


“ = 4arctan (1/5) — arctan (1/239) (2.43) 
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Figure 2.4 


was verified in section 2.3 with the help of a trigonometric identity. 
We can prove the similar formula 


“ = arctan (1/2) + arctan (1/3) (2.44) 


using just basic geometry. Triangle ABC in Figure 2.4 is an isosce- 
les right triangle with legs of length 4 (so by the Pythagorean theo- 
rem, the hypontenuse AB has length 4\/2). Point D bisects leg CB, 
and segment DE is perpendicular to AB. 


(a) Because 
CD 2 


omer 
we have ZC’AD = arctan (1/2). By finding the lengths of DE 
and AF, show that 


tan ZCAD = 


ZDAB = arctan (1/3). (2.45) 


(b) The results in the previous question are enough to establish for- 
mula (2.44). As we did in section 2.3, use the first three terms of 
the series (2.7) for arctangent to approximate 77. 

(c) Formula (2.43) approximates 7 more quickly than (2.44) does 
because of its larger denominators. So let’s improve on (2.44) 


by embellishing Figure 2.4 a bit. Locate point F' on C’B so that 
ZCAF = ZDAB. Now use (2.45) to argue that 
4 2 
FC= 3° andthus FD= re 
(d) Use the Pythagorean theorem to show that AD = 2/5. 
(e) Locate point G on AD so that FG is perpendicular to AD. Be- 
cause AAC D and AF'GD each contain a right angle and also 
share ZADC, we know that they are similar. Use this to estab- 


lish that : r 
GD = —= and FG=—=H. 2.46 
3V5 3V5 oe 
(f) From the figure it is clear that 
FG 
tanZFAD = —. 
= AG 


Use (2.46) to show that this ratio is 1/7. 
(g) So we have shown that 


7 = 2 arctan (1/3) + arctan (1/7). 


Use this formula, and the first three terms of the series for arct- 
angent, to approximate 7. 


2.2 Wallis’s formula for z/2 (via algebra). Section 2.4 presented a 
proof of Wallis’s formula 
2.2 4. 4 
i x=xX=.=xX RUT (2.47) 
2 1 3 3 5 5 7 
that depended heavily on calculus. The alternative argument outlined 


here requires only algebra. 


Let so = 0, 51 = 1 and, for n > 2, define 
3. OC«*dés 2n—-—1 


Sn = =X —X+++&X } 
2 =A 2n — 2 


(a) Let ey = 1 and, for n > 2, let e,, denote the product of the first 
2n — 2 factors of (2.47). (The letter e stands for even.) So, 


ey = 1, 
4 f 2 2 
=x rom =X = 
28 i 3) 
64 2 2 4 +4 
eg = — from -xX=x-=xX2c, 
45 1 3 3 =5 
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(b) 


(c) 


(d) 


(e 


wm 


(f) 


and so on. Write a similar definition for o,, (o for odd) and cal- 
culate 01, 02, 03. 
Explain why 


ey < €2 < €3 S++ SM En SK On S++ S03 < 02 < O1 


and justify the identities 


2n-—1 d 2n 
€n = ——> and on, = ——,. 
(Sn)? (Sn)? 
From the results gathered so far, derive the inequality 
2i—1 2i 
OT <(s,)?<— forl<i<n. (2.48) 
En n 
Now we appeal to geometry. Figure 2.5 shows the first quad- 
rant of an xy-plane partitioned into rectangles R;,; with lower 
left corner (s;,s;) and upper right corner (s;41,5;41). Let P, 
denote the polygonal region consisting of rectangles R;; with 
i+ 7 <n. The corners of P,, farthest from the origin are the 
points (s;,s;) with i+ 7 = n+ 1. Use the Pythagorean theo- 
rem and (2.48) to explain why the maximum distance from these 
corners to the origin is no more than 


2(n + 1) 
On 


(2.49) 


Then explain why the maximum distance from the inner corners 
of P,, (where 2 + 7 = n) is no less than 


2(n — 1) 


En 


(2.50) 


By the previous part, we know that P,, is contained in a quarter 
circle of radius (2.49) and that P,, contains a quarter circle of 
radius (2.50). Show how this sentence translates with the help of 
algebra to 

—1 1 

(n—=1)a < area of P,, < (n+1)a ;, 

2€n 20n 
Now we will show that the area of polygonal region P,, simply 
equals n. The heights and widths of rectangles R;,; are differ- 
ences in consecutive s; values, so let’s define 


(2.51) 


An = 8n4+1 — Sn. 


Figure 2.5 
First, explain why 
j+l a+1 
jay = —— iaj41 + i410). 2.52 
ORS Qtech gy os pupae et F eo 


Then apply (2.52) to each term in 
ApAn + A1An—1 +°+*+ + AnQo 
to convert this expression to 
A0An+1 + {An +++ + An41Q0- 


Why does this equivalence imply that the area of P,, is n? (Hint: 
start with agag = 1.) 


(g) So we may substitute 7 for the area of P,, in (2.51). Explain why 


it follows that 


—1 +1 
SS ee ae a 


on on (2.53) 
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In your own words, how does taking the limit as n — 00 in each 
expression in (2.53) yield Wallis’s formula? 


2.3. Converting an infinite product to a continued fraction. In section 
2.6, we expressed a function involving sine as an infinite product. 
Here we discover that such products may be expressed as continued 
fractions. This in turn will unlock a lovely formula for 7 (in exercise 
4). 


An infinite product a1;a2a3a@4--- resists conversion because contin- 
ued fractions involve sums, so we introduce this operation by con- 
verting the equivalent infinite product 


(1 + b1)(1 + bz) (1 + b3)(1 + ba) ++ 


to a continued fraction. 


(a) Skipping to the punchline would obscure the process by which 
one might discover the desired conversion. So we begin slowly, 
with the guess 


b 
(1 +b;)(1+b2) =1+ (2.54) 
1+ = 


where y and z are to be determined. Show that from (2.54) we 


can derive 
by + by + by, b2 = z 
by By 
Now if we choose z = 0; +b2+1 bo, explain why y = —bo(1+ 
bi). 
(b) Substituting for y and z in (2.54) yields 
(1+b,)(1 +b) =1+ z (2.55) 
: Sal bo(1+bi) ” 


- by + bg + by bo 


To extend the result, observe that 


(1 + bi)(1 + b2)(1 + bs) = (1 + b1)(1 + (b2 + b3 + b2b3)) 


rewrites the triple product on the left as a double product that 
compares to the left of (2.55), with bz + bs + b2bs in the role of 


bg. Replacing bz with bz + bg + b2b3 throughout (2.55), we find 
that the bottom fraction becomes 
(bo + b3 + bobs)(1 + b1) 
by + (b2 + b3 + babs) + bi (be + bg + b2b3) 


Show that this expression is equivalent to 


bo(1 + b1) 
bib3(1+b2) ’ 
by + be + bib 
1 2 192 be Aebacetohs 


thus extending our result to the conversion of a triple product. 
Note. The trick used in part (b) extends naturally to a proper 
induction proof of the full result that 


(1 + b1)(1 + b2)(1 + b3)(1 + ba) ++ (2.56) 


can be written as the continued fraction 


bi 


1+ 
bo(1 + 61) 


by b3 (1 + be) 
beba(1 + bs) 
bs + ba + bgba — --- 


bi + b2 + bib 


bz + b3 + babs 
Exercise 4 puts this conversion formula to good use. 


2.4 A harmonic continued fraction for 7. Here we may use the tech- 
nique described in exercise 3 to transform 


SD x)(1+2)(1 =)(1+5)(1 ayo (2.57) 


TL 


(which we obtain from (2.22) by replacing x with 7x) into a contin- 
ued fraction for 7 that features a simple and beautiful sequence. 


From (2.57), we get 


eo (t-g)O+a)O-a)(+a)G-g)-~ es 


by the substitution x = 1/2. 
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(a) Comparing (2.58) with (2.56) from exercise 3, we see that 
b} =—1/2, bo =1/2, 63 =-—1/4, andso on. 


Substitute these values into the continued fraction version of 
(2.56) and reach 


a : (2.59) 
T 159 , 
oe 
28 
1+ 
3-4 
sk 
ieoee 


via some appropriate algebra. 
(b 


mM 


While the continued fraction (2.59) evidences a nice pattern, we 
can further tidy it up. Denote by Q the infinite tail end of (2.59), 
and observe that 

2 1 14+Q 

rT oO 2.0. 
Inverting both sides, we have 

ee a ey eee 

2 14+Q 1+Q 
With the help of these observations and a bit of simplifying, dis- 
cover the lovely continued fraction 


=14 (2.60) 
1+ 


1 fo: 
eee 
fe ee 


with the harmonic series as its prominent feature. 


2.5 Finding the nth digit of 7. As long as we are patient, we can pro- 
duce any number of digits of 7 using formulas like 


ae Lat ae and 
4 35 7 9 


“ = 4arctan (1/5) — arctan (1/239) 


60 


(Nilakantha’s (2.8) and Machin’s (2.10), respectively). Quite recently, 
a team of mathematicians discovered a way to leap over any number 
of digits of 7 and land where we wish. The only limitation is that we 
must work not in base ten but in a base (like binary or hexadecimal) 
that is a power of two. 


(a) We will get to 7 in a bit. First, by a similar but simpler process, 


(b 


wm 


we will calculate the nth binary digit of In 2. Integrate the series 


1 
——=l4+arta?+a2°4--- 
l-@ 


and substitute z = 1/2 into the result to prove that 


“1 1 1 1 
In2= = se. (2.61 
" ai, Tot) 2.92 T 3.g3 T ea) 
Expressed in binary, 
In 2 = 0.101100010111001000010111111101.... (2.62) 


Let d,, represent the nth digit of In 2 in binary. Suppose that we 
wish to calculate d3q without calculating the preceding 29 digits. 
Note that multiplying (2.62) by 2?° yields 


2° In 2 = 10110001011100100001011111110.1... , 


having the effect of shifting the decimal 29 places to the right, 
leaving dgo as the digit directly after the decimal. Thus, if we 
can calculate the fractional part of 279 In 2, then d3o will be its 
first digit. 

So we multiply (2.61) by 279, simplify each term, and do a little 
grouping, to get 


oo ne salir ley iar ee 
scale (iy Maa ie a 27 ' 28 
a ae xa : + Ps (2.63) 
96, FAO. BP a4! 98280." : 


We want to determine the first digit after the decimal of the ex- 
pression on the right. The fractions in the second group are easy 
to translate to binary via long division, and they converge to 0 
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quite quickly, so we would stop when it is clear that they will no 
longer affect the value of dgo. 

The fractions in the first group are trickier. The first two sim- 
plify to integers, so they have no bearing on the value of d3o 
(because it is the first digit after the decimal). But it is not imme- 
diately clear if 27°/3 is an integer. Further, if it has a fractional 
part, it will be 1/3 or 2/3, but which? And while 27° has just 
eight digits, and is therefore accessible, what if we wanted the 
quadrillionth digit of In 2 instead of the 30th? 

Modular arithmetic comes to our rescue. Two true equivalences 
multiply to yield a third: 


a=bmodm 
= ac= bdmodm. 
c=dmodm 


To calculate 27° mod 3, we find certain powers of 2 modulo 3: 


2) =2 mod 3 
2? =1 mod 3 — 
2 =1 mod 3 
2® =1 mod 3+ 


2'6 =] mod3¢ 


by starting with the obvious top equivalence, then squaring each 
side (and reducing). Because 26 = 16 + 8 + 2, we multiply the 
equivalences marked with arrows to reach 


978 = T-mod 3; 


which tells us that the fractional part of 27°/3 is 1/3. Because 
only the fractional part can affect dgo, we can ignore the integer 
part of 27° /3 (whatever it is). In this way, we can march through 
the terms that have powers of 2 in their numerators, reducing 
each one to a fraction. 


Here are the partial results of applying this process to the frac- 
tions in (2.63). The — symbol indicates that we are keeping only 


62 


wm 


the fractional part. 


278/130 
27" /2 +0 
26 /3 + 1/3 
27° /4 + 0 
274/15 + 1/5 
273 /6 + 2/6 


2°? /7 4 2/7 


2? /27 > 4/27 


21/28 + 2/28 


Verify that each of these results is correct. 


Adding together these fractions (as well as a few of the fractions 
in (2.63), starting with 1/29) and converting to binary yields 


1010.1111100011..., 


and we can ignore everything except the first digit after the dec- 
imal, which establishes that dg) = 1. 

Now that we know how to find the nth digit of In 2 by first ex- 
pressing it as a sum of fractions, we can look for a way to express 
m as a similar sum. This task was completed in 1996 with the aid 
of a computer program, resulting in 


e 1 4 2 1 1 
— é 
2k \8k+1 8kK+4 8kK+5 8k+6 


(2.64) 
The terms in this sum differ a bit from those in (2.61), but only 
in that the factor & in the denominators has been replaced by 
polynomials in terms of k. This has no effect on the process 
described above. 
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We can verify (2.64) using techniques from elementary calculus. 
To begin, show that 


{Gaeta 
=T 
o wee. @=Deiy 


using (and these are merely suggestions) substitution and trigono- 
metric substitution. 


(d 


wm 


Combine the terms in the integrand and multiply the resulting 
numerator and denominator by 


(y? + 2)(y* + 2y + 2) 


to reach 


y=. 


i 16(y° + y* + 2? 4) 
0 y® — 16 


(e) Substitute y = 2x (and so dy = 2 dz) to find that 


c=. (2.65) 


i V2 4,/3 — 823 — 4/924 — 8a° j 
0 


1— 28 


(f) To link (2.65) to (2.64), we show that for k > 1 we have 


He gt = 


Establish this result using geometric series. 


Ww 


Finally, prove (2.64) by applying (2.66) to each of the four terms 
of (2.65). 


(g 


Postscript. Using (2.64) to calculate digits of 7 is best done with 
computer help, so we will not do so here. But we pause to note, 
with satisfaction, that although the birth of elementary calculus 
is several hundred years in the past, and the study of the constant 
am is even more ancient, the discovery outlined in this exercise 
eluded us until not long ago. We now know, for example, the 
binary digit of 7 that sits in position 2 x 10/°. 


CHAPTER 3 


e 


As a sociologist, my dad was familiar with studies that probe how the 
human mind works, and he occasionally performed similar experiments 
on my sister and me. For a Christmas gift, he gave each of us a two-pound 
chocolate kiss and a challenge: if either of us could eat the whole thing 
that day, he would reward us with another one. (This inverts the Stanford 
marshmallow experiment in which children are given a marshmallow and 
rewarded with another if they do not immediately eat the first.) I finally 
gave up gnawing on mine when it was a tooth-scarred ping-pong ball. 
My sister didn’t try. 

Dad also gave me an envelope containing a one dollar bill under the 
condition that I gift it to a total stranger with no explanation. (I still have 
it.) He once presented me a trade of all the money in his wallet for my 
next five weeks’ allowance of five dollars. Conservative and suspicious, 
I opted not to play. (His wallet had over thirty dollars in it.) 


3.1 The money puzzle 


Many of my dad’s challenges were puzzles, and one in particular takes 
us to our next constant. He offered me ten dollars or the option to split 
the ten and multiply the parts. Like five times five is twenty-five, I asked, 
and I get that much? Yup. So I went to work — two times eight is sixteen, 
three times seven is twenty-one — soon deciding that my original guess 
was best. But then inspiration struck: what about three times three times 
three times one is twenty-seven? Can I have that? Caught between the 
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wish to reward my initiative and the fear of the unknown, my dad decided 
to risk it, allowing more than two parts. 

The wasted 1 in 3 x 3 x 3 x 1 = 27 bothered me until I overruled my 
desire to make the parts equal and lumped it in with one of the threes, 
obtaining 3 x 3 x 4 = 36. My dad gulped. 

It wasn’t long before I hit on 2.5 x 2.5 x 2.5 x 2.5 = 39.06, prompting 
an emergency ruling. That’s it, my dad said. I thought the answer was 
twenty-five. Stop. (But he needn’t have panicked, because the maximum 
possible product is less than a dollar more.) Our search for that maximum 
product will lead us to our next constant. 

A bit of experimentation would persuade anyone that the product of 
the parts of the ten dollars will be greatest if the parts are equal (and, 
in fact, the arithmetic geometric mean inequality (1.17) assures us?" of 
this). If 2 is the amount in each part, then «!°/* is the product of the 
parts, so I was searching for the value of «x that would maximize x!°/”, 
For example, when I split 10 into four equal parts of 2.5 each, the yield 
was 2.519/2-5 — 2.54 = 39.06. While as a child I would have only 
considered splitting the ten dollars into a whole number of parts, let’s 
relax that stipulation and allow such products as 


2.610/2-6 ~ 2.63-85 ~ 39.45, 


which beats our current record. 
A bit of exploring (as in Table 3.1) suggests that x 
imum at a value near x = 2.7. In the next few sections, we will capture 


10/ reaches a max- 


2519/25 =~ 39.06 
2.619/2-6 ~ 39.45 
2.710/2.7 ~ 39.60 
2819/28 ~ 39.54 
2910/29 ~ 39.31 


Table 3.1 


the constant that would have maximized my dad’s payout. Leonhard Eu- 
ler generally receives credit for being the first person to incisively explore 
the properties and importance of the constant. It was he who chose the 
letter e to denote it. Although others, such as John Napier and Gottfried 
Leibniz, had knowledge of e before Euler, it was Euler’s investigations 
that provided the deepest insights. 
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3.2 Euler’s e = )°1/k! 


Living before the advent of limits, Euler calculated with “infinitely small” 
numbers (or “infinitesimals”), that is, numbers that (metaphorically) are 
infinitely close to zero without equaling zero. His search for an infinite 
series involving e began, in his words, “Since a? = 1, when the expo- 
nent on a increases, the power itself increases, provided a is greater than 
1. It follows that if the exponent is infinitely small and positive, then the 
power also exceeds 1 by an infinitely small number. Let w be an infinitely 
small number, or a fraction so small that, although not equal to zero, still 
a” = 1+, where w is also an infinitely small number.” 

Examples may help make Euler’s intent clear. Suppose that we choose 
a = 2andw = 0.001. Then 


29-091 — 10006933874... andso ~ = 0.0006933874... 


and we see that 7 is a bit less than w and that both are close to zero. 
Neither is, of course, infinitely small, but together they illustrate Euler’s 
point. If we change a to 3 and keep w = 0.001, we have 


3°-001 — 10010992159... andso w= 0.0010992159... 


instead, and we note that w > w in the first case while w < w in the 
second. Perhaps there is a number a between 2 and 3 for which w = w? 
Testing the special number 2.7 from section 3.1, we find that 


2.7°-001 — 1.0009937452... andso 7 = 0.0009937452... 


and we have our closest match between w and w yet. It seems that pur- 
suing the constant in Euler’s context will also answer my dad’s money 
puzzle. 

So we are narrowing in on the particular number a that forces w to 
equal w, so that 


a’ =1+w. 


To this end, let n be an infinitely large number (in the same poetic way 
that w is infinitely small) such that w = 1/n, and rewrite the equation as 


1 Ly" 
a/™ =14+— or w= (142) : (3.1) 
n 
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Now the generalized binomial theorem (2.23) allows us to expand the 
right side to obtain 


ie 1 ‘an n—-1 iy" 
= . . . 
1 n 1 2 n 


1 n—-1 Doggie n—-2 ee 
1! n 2! n n 3! 


as long as we accept that the theorem applies to exponents that are in- 
finitely large. Dividing every term in each fraction (that contains n) by 
n, we obtain 

Lyla th, ob 3 Tain) te aya) 4 


T T 


a=14 
1! 1 2! 1 1 3! 


Because n is infinitely large, all of the fractions within parentheses van- 
ish. Thus we have 
1 1 1 


ull 
SE ay tap thet 2 or a=) a: (3.2) 
k=0 


As Euler did, we will let e denote this constant: 
e & 2.71828. 


Putting (3.2) together with (3.1), and using modern notation, we arrive at 
two beautiful truths about the constant e: 


1\" coef 
e= Jim (1+ =) and eS a (3.3) 


Euler’s argument fails at times to be rigorous in the modern sense. How- 
ever, with the proper use of inequalities and limits, we can prove that 
these conclusions are correct. For example, we will see in section 3.4 
that a judicious use of the arithmetic geometric mean inequality proves 
that the limit on the left of (3.3) exists. 

Before we move on, let’s investigate the expression 


Ce.) 
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where x is a constant. This expression is quite like the one on the right in 

(3.1), and in fact we will manipulate this new expression much as Euler 

did his. Using the generalized binomial theorem to expand, we find that 
NM nae nn—-1 «2 nn-1 n-2 2x 

(1 + =) =14 . f . . f . . ; ae 
n ln 1 2 nn? 1 2 3. Oon3 


and, as before, we can manipulate!*”! this equation to the form 


lim a2) ieee ea: 
n—00 n 2! 3! 
If we set e” equal to this limit, so that 
e = lim (1+2)"=1t2+ 5+ 54 , (A) 
then with « = 1 we recapture both formulas in (3.3). These sorts of 


observations prompted Euler to declare (3.4) as the definition of e”. 
A final note: because the sum on the right side of (3.4) appears to 
equal its own derivative, we may suspect that 


The endnotes!*3! contain a proper proof of this well-known fact. 


3.3. The maximum of x!/“ 


Armed with (3.4), we return to the problem (from section 3.1) of max- 
imizing x!°/* (and thus maximizing the amount of money that my dad 
would owe me). Writing a!°/* as (a1/*)1° allows us to ignore the 10 for 
now and concentrate on maximizing the function x!/*. Now finding the 
maximum of a function typically involves its derivative. Here we take a 
different approach. 

We need to establish two facts before we maximize x!/* directly. The 
first fact carries the name Bernoulli’s inequality, after Jacob Bernoulli, 
who we met in section 2.9. 


Claim. If m is a natural number and y > —1, then 


(l+y)" >1+my. (3.5) 


For example, when m = 1 we have the obviously true statement 


l+y2l1t+y 
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and when ™ = 2 we observe that 
(l+y)? =14+2y+y? >142y (3.6) 


because y? > 0 (not just when y > —1, as the claim specifies, but for all 
values). So (3.6) says that 


(1+ y)? >1+2y 
and if we multiply by (1 + y) we get 
(1+y)*> (1+ 2y)(1+y) (3.7) 


as long as (1 + y) > 0 (that is, as long as y > —1). The claim therefore 
contains this stipulation. So now from (3.7) we continue: 


(1+y)* > 1+ 3y 4 2y? 


21+ 3y 
and we see that the claim holds for m = 3. A proper induction proof of 
the claim is included!**! in the endnotes. 
Our second task (before we maximize gi/ *’) begins with the limit 


e* = lim (1+ =\" 
n 


n—->co 

from (3.4). We wish to show that if we choose x > —1 then the expres- 
sion (inside the limit on the right) is monotone increasing, which is to say 
that as we let n increase, every term is at least as large as the previous 
one. For example, letting x = 1 (in the left column) and x = 2 (in the 
right) and taking n to be the first few natural numbers, we generate the 
approximations in Table 3.2, and see that they each form an increasing 
sequence. We can formalize our goal by proving the following claim. 


Claim. With x > —1 and na natural number, we have 
a nt+l1 a\n 
(1 + ) > (1+) 
n+l n 


The claim states that by bumping n up to n + 1, we get a new value that 
is at least as large as the previous value, just as Table 3.2 suggests. Note 
that the expression in the claim is equivalent to 


i n+1 
(1+ =) 
ee, 
7 > 
(oe 
n 


(3.8) 
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BR 


(l+q) = (l+q) = 
(144)? = 2.25 (1+ 2)? = 
(1+ 4)° ~ 2.370 (1+ 2)° = 4.630 
(1+4)'x 2441  — (142)* ~ 5.063 
(144)? ~ 2488 = (14+. 2)° = 5.378 
(1+ 2)° = 2.522 (1+ 2)° ~ 5.619 


Table 3.2: Some approximations that suggest that (1 + 2/n)” is monotone 
increasing. 


and we will prove this alternate inequality. We begin with the left side. 
The first few steps are merely algebraic manipulation: 


n(n+1)+ an ae 
n(n+1)+a(n+4+1) 


walt eh . 
lec 
ae 


n?+(a+1)n+a-—4 aa 
(n+1)(n+2) 


The quirky decision (in the last step above) to interject +x — x into the 
numerator allows us to factor n? + (a + 1)n + a as (n+ 1)(n +2) and 
therefore write 


(1+ a)" ; a 2 Gee 


(n+1)(n+2 
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Now compare the second factor 


(: aera) 


with (1 + y)” from the claim (3.5). If we put n + 1 in the role of m and 


(n+ D(n+a) 


in the role of y, then we can use claim (3.5) to proceed. Note, however, 
that (3.5) insists that y > —1, so we must check that 


(n+ Din+a) ~ 


(3.10) 


before we use it. Recall (from the current claim) that x > —1 and nisa 
natural number, so n > 1. Thus the denominator of (3.10) is positive, so 
we can rewrite it as 


g<(n+1)\(n+2) or O<n?4+n(xe+1), 


which is clearly true. So we may use claim (3.5) to write (3.9) as 


n+1 
x 
14+ —— 
( =) 


(1+=)" a) (-(@40- mae) 


It is easy to verify that the right side simplifies to 1 and thus that we have 
proved the claim in its alternate form (3.8). 

Armed with these two claims, we may finally show how to maximize 
1/© without using its derivative. At (3.4) we showed that 


the curve x 


oS lin (1+ =\" 
n 


n—->co 


and we have just proved that the expression inside the limit on the right 
is monotone increasing as long as x > —1 and n is a natural number. 
Thus, 
x n 
ev > (145) when x > —landn > 1. 
n 


Now we use claim (3.5), with 2/n in the role of y and n in the role of m 
to extend this inequality and say that 


% GN x 
e > (1+ =) >lt+n--—=1+2 whenze> -l. 
n n 
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So (changing the variable from x to y for a moment) we have established 
that 
ey >1+y wheny> -t. 


Recall that our original goal was to maximize x!/* when x > 0. Ob- 
serve!?>! that 


zr>0 7 


so we may let 


in (3.3) and conclude that 


gees ee, 


Because this inequality simplifies!>*! to 
el/e = gee 


we learn that x!/* reaches its maximum when x = e. So, at worst, my 


dad would have had to pay me (e!/¢)!° = 39.60 dollars. 


3.4 The limit of (1 + 1)" 


As promised in section 3.2, we will now see an example of how we may 
rigorously prove one of the limits that Euler discovered (by ingenious but 
mathematically shaky methods). Specifically, we will prove that the limit 


lim (1 4 *) (3.11) 
noo nr 


from (3.3) exists. Our primary prop will be the arithmetic geometric 
mean inequality. Let’s start by defining a sequence {y,,} (where n is 


a natural number) as 
1 n 
n 


and a companion sequence {z,, } as 


1 n+1 
a= (1 + =| : 
n 


Table 3.3 shows each sequence for the first few n values. Our proof that 
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BR 


(i++) = 20 (1+4) = 40 

(142)? = 225 (1+3)° = 3.375 

(1+ 4)? «237037 = (1+. 4)* = 3.16049 

(14+4)* ~ 2.44141 (1+ 4)” = 3.05176 

(1+2)° ~ 2.48832 (1+ #)° = 2.98598 

(14+ 2)° ~ 2.52163 1+)" = 2.94190 
Table 3.3 


limit (3.11) exists is in four acts: 


(1) Show that y, < z,, for every n. 
(2) Show that {y,,} is monotone increasing. 
(3) Show that {z,,} is monotone decreasing. 


(4) Show that {y,,} and {z,,} share a common limit. 


These observations appear to be plausible when one reads Table 3.3. We 
can prove the first using simple algebra. 


Claim. We have y,, < 2», for every n. 


Because 7 is a natural number, we know that 


1 
(1++) >1. 
n 


This observation explains the inequality step in what follows: 


ik ees ie 1 ie 
Blea) Se) Cra) Ces) am 
n n n n 
And so Yn, < Zn as desired. 


The second and third claims follow from the arithmetic geometric mean 
inequality. As discussed in section 1.7, it states that for any set of non- 
negative numbers a1, d2,...,@n, their geometric mean 


Gn = (a1a2 aes iene 


is less than or equal to their arithmetic mean 


a1 + a2 +++++4n 


9 


An = 


n 
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so 
a1 + a2 ++*' + Gn 


n 
with equality only when all of the a; are equal. Now we will use this 
inequality to prove the second claim. 


(ayaz-++an)'/" < ; (3.12) 


Claim. The sequence {y,,} is monotone increasing. 


Let’s warm up to this task by checking a couple of examples. We can 
directly calculate that 


ai 
= (1 + :) =o, (3.13) 


1\2 3\2 
i ee en ee ee 
ea a) 


so yi < Ya. But consider this unusual argument as an alternative: let 
m = 2 in (3.12) and choose a; = 1 and ag = 2 as our two numbers. 
Substituting into (3.12), we get 


ee 
(1-2)? < = 


with strict inequality because a; # ag. Squaring both sides yields 


3 2 1 1 1 2 
2< (3) or (1++) <(1+5) ; 


which matches the direct calculation in (3.13). So we have verified our 
direct calculation with a clever use of the arithmetic geometric mean 
inequality. We will generalize this clever trick after we verify (just for 
practice) that yo < y3. Let n = 3 in (3.12) and choose a; = 1 and 
a2 = a3 = 3/2 as our three numbers. Substituting into (3.12), we get 


2 3 
(1-3/2-3/2)¥8 < +82 +32 or (1+5) < (1+) 


SO Yo < yz as desired. 
Now we generalize: let’s choose the n numbers 


aj=1 and ag=a3=-::=Qy 
Substituting into (3.12) yields 
(1 n n re Ll+n/(n—1)+---+n/(n-1) 


n 
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There are n — 1 copies of n/(n — 1) on each side, so we may simplify to 


( n es n+1 
< F 


n—-1 n 


Raising both sides (of the second inequality) to the power n and doing a 
little algebra, we find that 


1 n-1 1 n 
(1+) < (1+) | 
n—-1 n 


SO Yn—1 < Yn. In other words, we now know that the entire sequence 
{yy} is monotone increasing. 


It may come as no surprise that we can prove the third claim in much the 
same way. 


Claim. The sequence {z,,} is monotone decreasing. 


As before, we will warm up with an example, say 23 < 22. Letn = 4in 
(3.12) and choose a; = 1 and az = a3 = a4 = 2/3 as our four numbers. 
Substituting into (3.12), we get 


14+2/34+2/3+4+2 a\ 3/4 
1/4 + /3+ /3+ /3 =e (3) = 


(1-2/3 2/3 - 2/3) ; 5 


Raising both sides to the power 4 and then inverting each fraction (which 
changes the inequality sign), we can simplify this to 


Ps? i> 
Aa ias2 
(+3) <(+5) | 


which says z3 < 22 as we wished. To generalize, choose the n numbers 


n—-2 


aj=1 and ag=a3=-:::=Qyn , 
n—-1 


Substituting into (3.12) and simplifying, we obtain 


a n—1 
< F 
n—-1 n 


Raising both sides to the power n and then inverting each fraction, we 


reach ‘ 
1+ 4 - 1+ d a 
n—1 n—2 : 
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SO Zn-1 < 2n—2. So we now know that the entire sequence {z,,} is 
monotone decreasing. 


To establish the final claim, we simply note that z, = (14+ 1/n)yn 


implies that 
1 n 1 n+1 
lim (1 + ) = lim (1 + ) ; 
noo n n+ co n 


and of course we let e denote this common limit. Because the sequences 
in these limits are monotone increasing and monotone decreasing, re- 
spectively, we can conclude that 


1 n 1 n+1 
(1+) <e< (1+) (3.14) 
n n 


for any natural number n, a fact that we will return to in exercise 4 when 
we talk about Euler’s constant. 


3.5 A modern proof that e = )°1/k! 


Here is a proof of the sum in (3.3) that uses integration by parts, a tech- 
nique that relies on limits, not infinitesimals, for its reliability. Consider 
the family of integrals 


t= xe “dx (3.15) 
where 7 is a natural number. The definite integral equals the area over 
the interval x € (0, 1) under the curve 


URS are 
n! 


The subscript in y, indicates that the curve depends on our choice of 
natural number n. Plots of the first few members of this family of curves 
suggest!>”! that the area bounded by the curves approaches zero as n 
grows larger. That is, we think that 


1p 
lim a, = lim =f ze “dx =0 (3.16) 
0 


n—0o noo n! 
is a Safe bet. We can prove (3.16) by noting a few characteristics of the 
curve y,,. At the left boundary x = 0, we have 


1 n ,—0 
yn(0) = 0 e -=0, 
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while at the right boundary x = 1 we have 


Yn(1) = ie = — so jim, Yn(1) = 0. 
So every curve in the family passes through the origin (0,0) on the left 
and a point approaching (1,0) on the right. Now if every curve in the 
family lacks a critical point in the interval (0, 1), then we know that each 
curve has no maximum points on the interval, so it must increase across 
the entire interval. To check for critical points, we set the derivative 


equal to zero, and conclude!**! that the curve has no critical points in the 
interval (0, 1). These observations together prove (3.16). 
We now turn to the family of integrals (3.15). Starting with n = 0, 
we calculate 
1 


ago = > 
0! Jo 


1 
ve dr = | e “dz = [-e-*], =l1--. 
0 


e€ 


When n = 1, we use integration by parts!>*! to find that 


1 a 1 

ay=——+ e “dx or ay=——+4. 
e 0 e€ 

Note that a; can be quickly found because we already calculated aj. We 

encounter this same shortcut when we integrate the general case. Using 

integration by parts with 


u= x” so du=nzx" 1 dz, 
dvu=e "dx so v =-e”, 
we find that 
1 . er 
an = a awe "dx 
n! Jo 
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The second term is a,_1, So we have established the recursion 


1 
An = —=— + Gn-1 = Gn-1 - —— - 
nle nle 
Starting with ag and using it, we conclude that 
1 
ag = 1 = 
e 
1 1 1 1 
a=a = 
‘ oie e le’ 
1 1 1 1 1 
Be Oh Be e lle le?’ 
and so on. In general, 
1 1 1 1 i 
or e lle 2e nle 
1 1 1 1 
=1 14 peat : 
e ( Tay uals =) 


Taking limits, we have 


F ee fat aay 
se = ae a oe aE 


By (3.16), the left side equals zero, and then solving for e yields 


= S- - (3.17) 


Thus ends our modern proof of this sum, verifying that Euler’s argument 
(in section 3.2) does indeed lead to a reliable conclusion. 


3.6 e is irrational 


It is certainly possible for an infinite series of rational numbers to equal 
a rational number; for example, the geometric series 
Lee vd 1 


es pear gee ge ree dl 
tat ep hag * (3.18) 
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sums via the formula 


where a is the initial term 1 and r is the common ratio 1/3. So the series 


(3.18) sums to 
1 3 


Lae 


a rational number. 

This is not so for the series (3.17) for e, a fact that we will demon- 
strate now, using a proof by contradiction. Suppose that e is a rational 
number, so set e = p/q. We split'*°! the series (3.17) after the term 1/<q!: 


Replacing e with p/q, and subtracting the first sum from both sides, we 
have 


If we multiply the left side by gq!, then every term simplifies to an integer, 
so the entire left side equals some integer. On the other hand, the right 
side’s terms fall between 0 and 1 when multiplied by qg! because 


Co 


q! 1 1 1 


Mo (q+l) @t+a@t2) @thatDa@t3 


1 1 1 


| | 
T 


< ae ie 
Qae Tete es lg sel)? 


a geometric series with a = 1/(q¢+ 1) = r that sums!!! to 1/q. Thus, 
the right side cannot equal an integer. This contradiction establishes that 
e cannot be rational. 


3.7 Stirling’s formula 


Formula (2.19) gives a good approximation for a specific question (what 
is the probability of flipping exactly n heads and n tails in 2n coin flips?). 
Even a slight tweak to this question forces an entirely new approach to 
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such an approximation. For example, the probability of flipping 40 heads 


in 100 coin flips is 
100 
40 100! 


9100 40! 60! 2100 © 


These factorials are murderous to find by hand. 

James Stirling (Scotland, born 1692) found an approximation to n! 
that, happily, involves both 7 and e. Our presentation uses a modern, 
calculus-based approach; Stirling’s original discovery did not. 

The addition property of logarithms makes it convenient to begin not 
with n! itself but with ln n!, because 


Inn! =Inl4+In2+---+lInn. (3.19) 


This sum is very nearly"?! the area defined by 


ip Ina dz, 
1 


which via integration by parts with 
u=Inx so du=(l1/zx)dz, 
dv = dx so Uv =2, 


gives us 


/ Ine dx = [ang —a]) =ninn—n+1. (3.20) 
1 

We can approximate this same area using trapezoids as in Figure 3.1. Let 
R,, Ro,...Ry—1 denote the areas of the “remainders” under the curve 
y = Ina that are not occupied by the trapezoids formed by the line 
segments shown. Then 


n—-1 


| Ing dz = S- areas of trapezoids + S- Rg. (3.21) 
i k=1 


First we will calculate the areas of the trapezoids, and then we will show 
that the combined areas of the remainders is finite even if we let n grow 
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A 
Ry. 
vE yr Lule) 
| R, or aaa 
| Pee 
} Ri # ; 
' Ve. i 
i Lt ‘a te 

| \ =: ; 
} + > val 
i 

Figure 3.1 


arbitrarily large. Recall that In 1 = 0, so 
Inl+m2_ n2+In3 


f t ids = 
S- areas of trapezoids 5 5 
In(n—1)+I1nn 
2 
=In2+1n34 tIn(n-1)+=Inn 
Comparing to (3.19), we see that 
1 
S- areas of trapezoids = Inn! — 5 Inn. (3.22) 


Using (3.20) to replace the left side of (3.21), and (3.22) to replace the 


first sum on the right of (3.21), we have 


n-1 

1 
nlan—n+1=Innl— ZInnt+ ) | Re. (3.23) 
k=1 


We must grapple with the sum of the remainders. First, we will establish 
that their combined area is limited even as n — ov, and then we will find 


the limit exactly. 


Claim 1. The combined area of the remainders is limited, that is, 


n—1 


lim Ss Ry exists. 


n—>0o 
k=1 
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The left drawing in Figure 3.2 shows the kth trapezoid ABC'D with the 
curve’s tangent line DE drawn to create trapezoid AB ED. The slope of 
DE is not only FE (because DF’ = 1), but also 


dy 1 
—Ing = 
dx ate es 


SO 


1 1 
Thus, 
: 1 1 
the area of trapezoid ABE. D = 5 nk tInk 4 | 
1 
=Ink+—. 
DEY OR 


In the right drawing of Figure 3.2, the tangent line at point C’ creates 
trapezoid ABCG. The slope of GC equals both 1/(k + 1) and GH, so 


1 1 
the area of trapezoid ABCG = 5 in (kK+1)4+In(k+1)- al 


1 
=In(k+1) - ——_.... 
ET) Fee) 
The arithmetic mean of the areas of trapezoids ABE'D and ABCG ex- 
ceeds the combined areas of trapezoid ABC'D and the remainder area 


Rx, SO 


1 1 1 1 
Ink-4 tIn(k+1 Ink +1 + 1)} 4 : 
jr tigg = seca] > gla me lee 
= 
i + see a ei 
a, 7 |¢ 


Oe] te A . - — jb ~ 
k k+l k k+l 


Figure 3.2 
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This simplifies nicely to 


1 1 


— — ——— > Rx. 
4k A(k+1)~°* 
The combined area of the remainders R,, Ro,...,Ry_1 thus is 
bounded: 
n-1 n-1 
1 1 1 1 
ye < do ( )=i-m 
ramet = 4k 4A(k+1) 4 An 
Therefore, 


n-1 
. 1 
ear 
k=1 
All R;, are positive, so their sum approaches a limit that we will call L. 


This establishes Claim 1. 


Claim 2. The limit LZ of the combined area of the remainders equals 


1—Inv 2a & 0.08106. 


Rearranging (3.23) yields 


n-1 
S> Re —1l=nlnn—n-Inn! +4 (1/2) Inn. 
k=1 


Taking the limit of both sides, we can write 
L-1xninn—n-—Inn!+4 (1/2) Inn. 


Exponentiating and simplifying!*!, we reach 


(3.24) 


Now if we can evaluate the limit on the right, then we can solve for L as 
desired. Letting S,, denote the expression on the right, so that 


lim S, = lim eva 


n—0o noo nlenr ’ 


note that 


n2nr n 


lim (S,,)? = lim 


n—0o noo n! n! e2” 
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bears some resemblance to (2.16) and (2.19), which combined give 
(Qn)! _ 1 
nin! 222 ~~. /e(n + 1/2) 


: (3.25) 


We can make the resemblance stronger by noticing that 


(2n)?”/2n 


me enh 
and so the ratio of (Si,)? to Sap, is 
(Sale pee __(2n)! e2n 
Son ninte2”” (2n)2"/2n 
(2n)! n 


~ nln! 222° Jan’ 
Taking the limit of both sides and using (3.25), we find!) that 
(Sy? : 1 n 1 


lim a 


1m . — “s 
noo Son noc y/r(n+1/2) V2n V2a 


The limit on the left side can be simplified!'*>! another way: 


2 
2 lim (S,)? (lim Sn) 
lim (Sn) SS = oes = lim Sy. 
nco Son lim Son lim Sj, n-00 
noo noo 


These last two results imply that 
1 


lim S, = —. 


noo V/2 


Now we can use (3.24) to calculate!*® that, as claimed, 
L=1-Inv2n. 


More importantly, however, we have from (3.24) that 


or 


nlx Vann (=)". (3.26) 
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This result is called Stirling’s formula; its right side involves computa- 
tions less cumbersome than those required to calculate n! directly. For 
example, to approximate 40! to five significant digits without a calcula- 
tor, we use 


Methods for computing square roots such as \/27(40) by hand are as 
ancient as the invention of the wheel (not metaphorically!) and we can 
find the 40th power of 40/e using logarithms: 


40 
N (2) 
e 


InN = 40ln (2) 


e 
In N = 40(In 40 — Ine) 


N = 5.135945 x 10*6 


The final approximation for 40! is 8.1592 x 104” and required four mul- 
tiplications, one division, one square root, and two uses of (in lieu of a 
calculator) a book of logarithms. These operations are not trivial, but they 
have the advantage of being possible in a way that a direct calculation of 
40! is not. 


3.8 Turning a series into a 
continued fraction 


From (3.4) we have 


| rn ri 

e€ Ste oy oh day es 
and when x = —1, 

ee ak 1 1 

e. 2! 3! «A! 
or 

1 1 1 1 1 

1 = + fore, (3.27) 
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The series on the right has the form 


1 1 1 1 
+ 


ay a a2 a1 a2a3 a1 a20304 


bert (3.28) 


that we will now convert to a continued fraction. 
The first two terms may be written 


1 1 _ag—l 


ay a1a2 aia2 


and the reciprocal of the right side may be written 


ay,ag - ai(ag —1) +a, -_ ay 
aj —-1 ag—1 @g—1- 


So we can express the first two terms of (3.28) as 


= (3.29) 


in continued fraction form. 


Now we extend to the first three terms of (3.28), writing 


1 1 " 1 = 1 ag-1 
ay a1a2 a1a2a3 = ay a1 4203 
1 1 


ay a2a3 : 
a1 
a3 — 1 


With the expression in parentheses taking the role of a2 on the left of 
(3.29), we may conclude that the identity 


1 1 1 1 


+ 
ay a1a2 a1 a2a3 ay 
ay | 


a 
ag—1+ 2 


a3 —1 


properly converts the first three terms of series (3.28) to a continued frac- 
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tion. This method extends to an inductive proof of the general result 


1 1 1 1 
ay a 1a2 414243 41420304 
1 
a1 
ay } 
a2 
ag 1 | 
a3 
a3 1+ 
a4 = 1 + aiid: fo 


that we now apply specifically to (3.27). Setting a, = k yields 


1 a (3.30) 


2+ 


B44... 


and, to create a continued fraction for e alone, we rewrite the left side 


1 1 1 
1--=-- — 
€ 1 dl-e 
and compare to (3.29) to obtain 
1 A 1 
i oo oa 1 
1 
TSA 
Now (3.30) reads 
1 1 
pe 1 
1+ 1+ 
e—1 2 
1+ 
3 
2+ 
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from which!47! 


5+ 


(ele 
follows. This continued fraction takes its place beside (1.16) and (2.60), 
continued fractions for y and 7 respectively, in the museum of the most 
curious and graceful expressions in mathematics. 


3.9 Further content 


3.1 The probability of derangement. In 1708, Pierre Renard de Mont- 
mort (France, born 1678) posed a question about a card game named 
Treize (thirteen in French). In a simplified version of the game, a 
player shuffles a deck of thirteen cards that are numbered 1 through 
13. The player flips a card face up from the deck and says “one”, 
then flips the next card and says “two”, and so on. If at any time the 
spoken number matches the number on the card, the player wins. 


On the other hand, the player loses if no card sits in its numerical 
position in the shuffled deck. We call such an event a derangement 
of the cards. For example, the shuffled deck 


10 7 13 9 11 8 5 2 4 3 5 1 12 


leads to a loss because it is a derangement. Montmort’s question was 
the natural one: what is the probability of winning? 


We win with each of the shuffles 
9 138 5 12 4 8 1 2 10 7 11 3 6 
and 


7 9 3 11 5 12 2 8 6 13 10 4 1 


because each underlined card sits in its numerical position. These 
examples illustrate the difficulty of Montmort’s question, in that we 
win when one card sits in its numerical position, but also when two 
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cards do, or three cards, and so on. Using a fact from the study of 
combinatorics called the principle of inclusion-exclusion, we can an- 
alyze this complex situation. 


Here, however, we answer Montmort’s question indirectly by cal- 
culating the probability that we lose the game. Now we need only 
count the derangements. 


(a) Let D,, denote the number of derangements of a deck of n cards 


(b 


(c 


(d 


wm 


wm 


wm 


numbered 1 through n. By writing all possible shufflings, check 
that D, 0, Do 1, Dg 2, and D4 = 9. 


Next we will argue that when the deck has n > 2 cards, then 
Dy = (n—1)(Dn-1 + Dn_2). (3.31) 


But before we do, verify that (3.31) yields D3 = 2, D4 = 9, 
Ds = 44, and Dg = 265. (And if you enjoy it, continue until 
you reach D,3 = 2290792932, the (huge) number of derange- 
ments of 13 cards.) 


To prove (3.31), consider just the first flip in the game. We have a 
derangement only if we flip one of the n — 1 cards other than the 
1 card. This explains the first factor on the right side of (3.31). 


Say that the first card shows the number Kr. One of two possi- 
bilities exist: either the th card flipped shows the number 1, or 
it does not. The number of derangements of the rest of the deck 
is D,,—1 in one case, and D,,_2 in the other. Explain which is 
which, thus establishing the validity of (3.31). 


For (3.31) to be useful when n = 2, we need to define Do, the 
number of derangements of a deck with no cards. Explain why 
(3.31) forces Dp = 1. What common sense can you make of this 
answer? 


(e) Now rearrange (3.31) to obtain 


DS nD =D lh = Da oh (3.32) 


(f) The expression in brackets on the right side of (3.32) equals the 


expression on the left but with every n replaced by n — 1. With 
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this in mind, check the next three cases after these two: 


Depp py 0 

Dy 3p (Po) Set, 

D4 — 4D3 = —[D3 — 3D2| = —[-1] = 1, 
etc. 


Argue that, in general, 
Dy —nDy-1 = (-1)”. (3.33) 


(g) Divide both sides of (3.33) by n! and rearrange to obtain 


Dy, Dn —1)" 
a i CY) 
n! (n —1)! n! 


Explain why this equation implies that 


Da _ > (1k 
= x a= (3.34) 
k=0 
(h) Because n! counts the number of shufflings of a deck of n cards, 


then (3.34) with n = 18 calculates the probability of losing a 
game of Treize. Now compare the right side of (3.34) with (3.4) 


(evaluated at x = —1) to explain why 
. Dn 1 

lim —=-. 
noo n! € 


This result answers a stronger version of Montmort’s question, 
namely, as the number of cards in the deck increases, the prob- 
ability that the player wins approaches 1 — 1/e, or about 63%. 
Are you surprised to learn that about twice as many games of 
Treize are won than lost? 


3.2 Deranged matchings. This exercise appeals to the principle of 
inclusion-exclusion, and so might require some preliminary study. 


On a school trip, kindergarten students are paired at random for 
breakfast seat assignments, and then again at lunch. In this exercise, 
we will find the probability that none of the breakfast pairings are 
repeated at lunch. 


92 Se 
a & a C 
J b _-* ° ‘ , : 
A é ¢ a» 
oa. 9 fe é 9 
e a® 
en 
Oh Fis 
Figure 3.3 


Using the language of graph theory, we can think of each child as a 
vertex (represented by dots labeled a, b,c, ..., h in Figure 3.3) anda 
pairing as an edge (the lines in the figure). There are two graphs 
in Figure 3.3, one for breakfast (the solid edges on the left) and 
one for lunch (the dashed edges on the right). This pair of graphs 
depicts a deranged matching because none of the breakfast pairs 
(ad, bc, eh, fg) are reunited at lunch. 


(a) Suppose that there are 2n children (we need an even number 


(b 


wm 


for the pairing to work). Prove that the total number of possible 
matchings at breakfast is 


(2n — 1) x (Qn—3)x-+-x5x3x 1. 
Because this expression resembles a factorial, we use the nota- 


tion (2n — 1)!! to abbreviate it. For example, 


T!=7x6x5x4x3x2x1=5040_ but 


T=7x5x3x1 = 105. 


Because (—1)!! would equal the number of matchings when n = 
0 (that is, when there are no children), we define (—1)!! = 1 to 
count the empty matching. 

At breakfast, we choose pairs of children at random to be part- 
ners in one of these (2n — 1)!! ways. Let’s use the list 


Pi Pe Fe oka. Peg Ps 


to label the n pairs. At lunch, we match the children at random 
again. Define the event 


Ax = the breakfast pair P; is reunited at lunch 
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(c) 
(d) 


(e 


wm 


so that, for example, the union A; U Ag signals that pairs P, and 
P, ate both breakfast and lunch together. (This specific union 
does not preclude pairs other than P, and P2 from eating both 
meals together as well.) Lastly, let 


N¢ = |intersection of any @ of the sets Ax]. 


For example, to calculate N3 we must intersect any three events 
(A, M Ag M Aa, for example) and tally all possible pairings in 
which the corresponding breakfast pairs (P,, P2, and P, in this 
case) are also lunch partners. 

Explain why, for any @, the choice of which ¢ events to intersect 
is irrelevant. 

Demonstrate that Ng = (2n — 2¢—1)!!. 

Our strategy invokes the principle of inclusion-exclusion as a 
way to count the number of possible lunch matchings that repeat 
at least one pair from breakfast. Explain why the left side of 


U A, Ss se (—1)*+ (,) (3.35) 
k=1 


counts this, and then use the principle to justify the right side. 


To find the number of lunch matchings that do not contain any 
of the n breakfast pairs (that is, the number d, of deranged 
matchings), we subtract (3.35) from the total number of possible 
matchings. Provide the missing details that bring us to 


di (2) (2n-2k—1)!. (3.36) 
k=0 


The probability of a lunch matching being a deranged matching 
equals the left side of (3.36) divided by 
(2n — 1)!!, so we wish to divide the right side by the same ex- 
pression. We introduce the double falling factorial 


((2n — 1)), = (2n — 1) x (Qn —3) x -+- x (Qn— 2k +1) 


to streamline the notation. The subscript & on the left indicates 
the number of factors on the right. For example, 


((9))3 =9x7x5=315. 
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3.3 


With this in mind, explain how we may derive 


n 


ae =o > ((2n — 1))x (3.37) 


k=0 


from (3.36). 

(g) Use (3.37) to find the probability of a deranged matching if there 
are eight children. 

(h) As in exercise 1, we reveal e when we let n become arbitrarily 
large. If we focus on a particular term of the sum in (3.37), so 
that k is fixed, we find that 

k 
aim |-aF () (@n— vy] = SEF 
Provide the necessary details. Hint: before taking the limit, di- 
vide every factor (that contains n) by n. 
(i) Finally, use (3.4) to show that 


i dy 1 

im ———— = —. 

n>00 (2n—1)!! Ve 

As the number of children increases, then, the probability that 
none of the breakfast pairs are reunited at lunch is close to 61%. 


Euler’s constant . Here we detail the origins of a constant whose 
tale would easily expand to a full chapter of this book. 


Figure 3.4 provides a visual inspiration for our upcoming work. To 
approximate the area under y = 1/ax between x = 1 andz = n 
(where n is a whole number), we can draw the rectangles shown in 
the figure. Their total area exceeds the area beneath the curve by the 
areas of the shaded regions. Imagine sliding each shaded region, ex- 
cept the one above the interval (1,2), directly to its left until it is 
settled in that interval. None of the shaded regions would overlap, 
and together they would be contained in a unit square. Thus it ap- 
pears intuitively clear that the combined area of the shaded regions 
converges to a value between 0 and 1 even as n approaches infinity. 
Euler’s argument, which follows, allows us to draw this conclusion. 


In what follows, let 
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Figure 3.4 


Note that lim,_,.. Hp,1 gives the harmonic series (seen previously 
in exercise 4 where it appeared in a continued fraction for 7) and 
that lim,,_,.0 Hn, is the sum in the Basel problem (section 2.9). 


(a) From elementary calculus, we know that the Maclaurin series 


Oe ake. std 
In(l+n)=n res (3.38) 


converges for —1 < n < 1 and that 


n+1 1 
| —dx=In(1+n). 
i av 
Now observe that 


2 3 =4 n+l 
In(Lbn) In (Fx 5 x Fx ox ) 


n 


=In(1+1)+In(14+1/2)+---+In(14+1/n). 


Use (3.38) to expand each term in this last sum, and then obtain 


1 1 
In(1+n) = Ha 5Hnat-t—Hnn 3.39) 


by judicious factoring. (The sign on the final term depends on 
the parity of n.) 
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(b) We may rearrange (3.39) to 
1 1 1 
Ay 1 —ln (1 + n) = 5ttn,2 — > An.3 ati? ee iy: (3.40) 
D402 2 is no? 
Explain why the left side equals the desired area as n becomes 


infinite. 

(c) Now turn to the right side of (3.40). As n approaches infinity, 
why does the alternating sum converge? 
Historical note. We use 7 to denote the constant to which the 
right side of (3.40) converges as n approaches infinity. Euler’s 
interest in y ~ 0.577218 lasted for decades, during which he 
discovered other methods for calculating its digits. His primary 
motivation, however, was to link y to other notable numbers 
such as e and 7, something that 7 has been somewhat resistant 
to in the years since. This discussion finds its home in this chap- 
ter because e” has made frequent appearances in the work of 
post-Eulerian mathematicians. 


3.4 Euler’s constant +y via e and =. Here is the outline of a proof that 


Euler’s constant yy (introduced in exercise 3) as defined by 


: 1 1 1 
y= jim [+545 4-042) - mins) (3.41) 


exists. (This definition comes from the left side of (3.40).) The con- 
stants e and 7 each have a role in this argument. 


(a) We start with (3.14), which is 


1 n 1 n+1 
(1++) <e< (1++) (3.42) 
n n 


for any natural number n, and draw from it two conclusions. 
First, by taking the natural logarithm of each side of the inequal- 
ity on the left, show that 


1 1 
In (1 + ) ou (3.43) 
n n 


(b) Again, from the left inequality of (3.42) we have 


1 n 
e> (142) ; 
n 
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(c 


wm 


(d) 


(e) 


so define a,, as the multiplier that will raise the right side to 
equal the left side: 


€=An (1 + =) : (3.44) 
n 


From (3.44) and the inequality on the right of (3.42), derive the 
inequality 
1 
Qn <14+-. (3.45) 
n 
If we take the natural logarithm of each side of (3.45) and use 
(3.43), we reach 
1 
In (an) << —. 
n 
Establish that 
1 
= aig (apa/®) 4+In(n+l1)—In(n) (3.46) 
n 


by first taking the natural logarithm of each side of (3.44) and 
then simplifying appropriately. 

We can create the following m equations from (3.46) by letting 
n equal 1,2,...,m: 


1=I1n (a?) + In (2) — In(1) 


=n (a."/”) +In(3) —In(2) 


Wl Nle 


=In (as'/*) + In (4) — In (3) 


1 _ 1/m 
— =In (am ) + In (m +1) = In(m) 


Add these equations and simplify to reach 


| “1 
14 stat In(m +1) =} 7 7 lnax 
k=1 
Finally, with the help of (3.45) and (2.35), conclude that 
1 1 1 Ww 
14 a ] 1 —. AT 
5 + 3 7 n(m+l1)< 6 (3.47) 
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(f) Thus, the left side of (3.47) is bounded above, so the limit in 
(3.41) exists as long as the sequence {7,, } defined by 


1 1 1 
ma (1egt gee tt) mney) 


2 3 
is monotone increasing. To show this, show that 7, — Yn—1 > 0 
with the help of (3.43). 


3.5 The probability integral i ee e-’ dz = /7. (This exercise re- 
quires knowledge of multivariate calculus and polar coordinates.) 
Certain probabilities may be estimated using the normal distribu- 
tion, which involve a particular family of integrals. The integral 


i eda (3.48) 
is related to this family, and is of interest to us because, like Stirling’s 
formula (3.26), this special probability integral links e to 7. It cannot 


be computed directly by elementary means, but a foray into three 
dimensions sidesteps this difficulty. 


(a) Let J denote the integral (3.48) and observe that 


2 ad ee: oe 
[= e ” dx e ” dz. 


As a cosmetic (but crucial) change, we replace x with another 
variable (we will use y) in the second integral to obtain 


i =| e® dx / eY dy. 
‘co [. i eo tH") dy dx. 


(b) Convert this integral to polar coordinates to show that 


Bice 27 oe) 2 
r= re dr dé. 
0 0 


(c) Use the substitution u = r? to evaluate it and conclude that 


Explain why 


I? = a, from which the desired conclusion follows by taking 
square roots. 


CHAPTER 4 


@&. 


The area A of a circle with radius r can be calculated using the formula 
A=nr’, 


so the area of a unit circle (where r = 1) is 7. How much of this claim 
makes sense? If by “sense” we intend to involve our physical senses, then 
perhaps nothing about the claim makes sense. To settle the truth about the 
claim in a way that we can see and touch, someone would first need to 
produce a circle. But no one can. 

Many natural and human-made objects are circular, to be sure, but 
circles exist only in the abstract. Yet they could hardly be simpler to 
define: On a flat surface, choose any point. Now choose a distance. A 
circle is determined when we locate all points on the surface that are 
equidistant from the first point. 

The very symmetry and simplicity of this definition defeats any at- 
tempt to physically create a circle. If only the definition allowed some 
wiggle room, as in “all points on the surface that, within a small error, 
are equidistant from the first point,” then perhaps we could create a cir- 
cle that we could see and touch. Then we would need to prepare a flat 
surface (none exist) and choose a point on it (whatever that means). 

Perhaps “within a small error” can help us resolve the notions of flat 
and point in the same way that we used this phrase to relax the meaning 
of equidistant. Can we honestly claim to be defining a circle any more? 
This squirming would certainly cost us the elegant link between circles 
and the constant 7. 
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Despite the lack of actual circles, many people find it fascinating to 
explore a claim like “the area of a unit circle is 7” as a purely imagi- 
native task. Perhaps our willingness to imagine was sparked when we 
were young, while being taught to count with piles of buttons or pieces 
of chocolate, and our minds suddenly grasped that there is no largest 
number. So now, confronted with the number 


56 873 403 284 337 447 984 278 897 484, 


which quite likely has never been written or uttered by a human before 
this moment, you are perfectly willing to accept it as meaningful, and 
even feel confident that it could be the answer to some question. At noon 
last Tuesday, for example, perhaps a butterfly brought the combined total 
of wings flapped by creatures in the universe to this precise total. 


4.1 Proportions 


When you grew older, and there were three cookies to share with your 
one friend, you learned proportions. When ancient thinkers discussed 
proportion, they operated in the realm of geometry, interpreting propor- 
tion as, say, the ratio of the lengths of two line segments. For example, 
the concept of one half conjured in their minds a geometric analogue 
such as a bisected line segment or a square cut by its diagonal. 

In fact, these thinkers discovered how from a unit segment and two 
given segments of lengths a and 6 (as in Figure 4.1) they could construct 
a segment of length a/b. Here is one such construction: arrange the seg- 
ments of lengths a and b perpendicular to each other, in the manner of 
AB and BC in the figure. Now locate point D on BC so that DC = 1. 


C 
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Figure 4.1 
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Draw DE perpendicular to BC. Because AABC' ~ AE DC, we have 
a/b = DE/1, so DE has the desired length. 


So starting with a unit segment and using simple tools and logic, we 
can construct a segment that has, it would seem, any length that we wish. 
For example, we can produce a segment of length 3/7 by laying 3 copies 
of the unit segment end to end (and assigning this segment of length 3 
to the role of a in Figure 4.1), doing much the same with 7 copies to 
produce b, and then following the method described above to create a 
segment of length 3/7. (If a > b, the process requires a tweak.) By this 
method, we can claim to have created all positive proportions. 

Many ancient scholars believed that such proportions constituted all 
possible quantities, so they suspected that their method could create all 
numbers that exist. While they were rightly proud of having mastered 
proportion, they had only explored a tiny corner of what numbers have 
to offer. 

Among these thinkers was one who produced a line segment in a 
perfectly ordinary way and then demonstrated that its length was not the 
ratio of two whole numbers. In other words, his line segment could easily 
be drawn, yet its length represented an entirely new kind of number. To 
create it, make square ABC'D from a unit segment. Bisect the square 
with a diagonal AC. By the Pythagorean theorem, AC? = 17 + 1? = 2. 
The irrationality of AC’ = \/2 can be demonstrated in the same way that 
we proved \/5 irrational in exercise 6. This segment does not follow the 
old rules; it opens a path to unexplored territory. 

Such irrationals as J 2 may be constructed much like the rationals we 
built in Figure 4.1. Given segments of lengths a and b, lay a and b end to 
end (as in Figure 4.2) to create segment AC’. Locate midpoint M of AC 


D 
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Figure 4.2 


102 4% 


and draw the semicircle with center M and radius MA = ME = MC. 
Construct BD perpendicular to AC’. Because ZADC is inscribed in a 
semicircle, it must be a right angle. This implies that 7DAB+ZDCB = 
90°, but because Z7BDC+ZDCB = 90° as well, we see that ZDAB = 
ZBDC. Thus AABD ~ ADBC, so 


AB BD 


sae ert BDV 6. 
aD pe Vab 


To construct V6, for example, we can choose a = 2 and b = 3 and 
proceed as just described. 

If we learned our lesson earlier, we will hesitate before claiming that 
now we can construct all possible lengths and therefore all numbers that 
exist. A moment of thought might spring negative numbers to mind. If 
all numbers must be tied to geometric constructs, such as line segments, 
then negative numbers appear to be either nonsense or off-limits. For cen- 
turies, in fact, mathematicians in some cultures considered negative num- 
bers to carry no meaning whatsoever thanks to how tightly they linked 
numbers to geometry. 


4.2 Negatives 


Today, we have little difficulty understanding that if the temperature drops 
20° from its current 15°, then the new temperature is 15 — 20 or —5°. 
But there is a subtle change in the meaning of the symbol — as we pro- 
cess this statement; in 15 — 20, it indicates an interaction between the 
two numbers, whereas in —5°, the symbol points us in a direction. Our 
ability to mentally navigate this subtlety is quite modern. 

Yet this facility with negatives has ancient roots. As many as two 
thousand years ago, merchants in Asia employed counting rods, often 
colored black and red, to track payments and calculate bills. A customer 
who gave a merchant 10 coins, for example, but selected goods worth 14, 
would prompt the merchant to show the customer 10 black rods and 14 
red, removing them in black-red pairs until 4 reds remain. Neither person 
has written “10 — 14 = —4” but certainly the customer understands that 
she needs to fork over four more coins. 

Despite this common and natural interaction with negatives, the math- 
ematical community grappled with the idea of numbers that could be 
“less than nothing” for centuries before coming to a consensus about how 
they should be understood and used in calculations. Among the puzzling 
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questions they considered were: Must we reject wholesale the link be- 
tween numbers and geometry to accommodate negatives? What happens 
when negatives are multiplied or divided? Does a negative solution to an 
algebra problem carry any meaning? 

By way of illustrating the complexity of this long discussion, con- 
sider this argument proposed by John Wallis (who we met in Chapter 2). 
The values of the fractions 


1 1 1 1 1 
5 4 3 2 1 


increase at each step because the denominators are decreasing. The se- 
quence 


1/72 1/8 1/4 1/5 1/6 


exhibits the same behavior. As the denominators approach zero, the value 
of each successive fraction grows without bound, prompting Wallis to 
hypothesize that 
5 = 00 

might mean something. But if negatives are the natural continuation of 
decreasing positives as we pass through zero, then isn’t it plausible to 
say that dividing 1 by a negative number yields a quantity greater than 
infinity? Can we write 


7 eS 
=i] os 


and possibly be making sense? 

This musing by Wallis represents just one voice in the chorus of ideas 
prompted by negative numbers. Arguments were passionate; minds were 
changed; definitions were proposed, confronted, discarded and rebooted. 
And if this flow of ideas was turbulent, imagine how vibrant the discus- 
sion became when someone proposed the idea of subjecting a negative 
number to a square root. 


4.3 Chimeras 


A square root of a number JN is a number that, when squared, equals N. 
Because 3 x 3 = 9, for example, we say that 3 is a square root of 9. 
And because the product of negative numbers is positive, a square root 
of a negative number cannot exist. An expression such as \/—9, then, 
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is buried beneath two layers of obscurity; if —9 itself triggers cognitive 
chaos, because it is less than nothing, then surely its square root must be 
banished to the universe of pure imagination. 

As late as the 1800s, in fact, a number such as \/—9 was decried as 
“chimerical” (that is, monstrous, in reference to a creature from Greek 
myth that was built from parts of a snake, a goat, and a lion). A modern 
definition of chimerical good-naturedly flips its meaning by defining it as 
a desirable quality that is, sadly, impossible to attain. This transformation 
in the meaning of the word mirrors the shift in tone with which thinkers, 
over time, spoke of chimeras like \/—9. 

John Wallis thought deeply about the connections between geome- 
try, its younger sibling algebra, and about what each might explain about 
negative numbers and chimeras. Wallis suggested that we extend the 
metaphor that connects a number to the length of a line segment by car- 
ing as well about the direction of the segment. To illustrate, we revisit 
Figure 4.2 and treat point B as a sort of origin, like a zero, and stipulate 
that segments lying to the right of B have positive length (so BC has 
length b) while segments lying to the left of B have negative length (so 
BA has length —a). Now if we follow the same argument as presented 
with that figure, we conclude that 


BD = V—ab. 


This interpretation of the figure potentially offers us a snapshot of a 
chimera caught in the wild. 

While Wallis’s argument by no means settles any matters, his inspi- 
ration helped to loosen centuries-old biases against negatives and their 
square roots. There are, after all, scientific reasons to assign a direction 
to a number, whether we are describing physical forces such as gravity 
or trends such as rising temperatures. In fact, as we will soon see, our 
modern understanding of chimerical numbers relies heavily on the con- 
cept of direction. Yet it should be noted how carefully and patiently the 
conversation progressed: about 150 years after Wallis’s thought experi- 
ment, the label “chimerical” appeared in print. Clearly a consensus was 
difficult to attain. 


4.4 Cubics 


The cubic equation Ax* + Bx? +Cx+D = 0 played a crucial role in the 
acceptance of chimerical numbers as meaningful. In exercise 1, we will 


4.4 Cubics 105 


witness the geometric approach of Omar Khayyam (Persia, born 1048), 
whose methods drew inspiration from centuries of effort by other Persian 
mathematicians. Qin Jiushao (born c. 1200) spearheaded a millennium 
of Chinese attacks on cubics, following algorithmic lines in the tradition 
of his predecessors. 

Working with cubics is so difficult that Luca Pacioli (Italy, born c. 
1445) suggested that a full understanding was impossible. Nevertheless, 
several other Italian mathematicians made progress during the ensuing 
decades, despite the additional burden of the lack of a symbolic algebra. 
They described the expression x? = 6x + 10, for example, as “a cube 
equals six things and ten”. Bear this in mind during what follows, where 
we use the notation of algebra in a decidedly non-1500s way. 

Here we consider just two of the mathematicians involved with the 
story of cubics, Scipione del Ferro (Italy, born 1465) and Girolamo 
Cardano (Italy, born 1501). It is said that on his deathbed, del Ferro 
told one of his students his solution to cubics of the form taken by x? = 
6a + 10. Such cubics are called depressed thanks to the missing x? term. 

The trick that del Ferro discovered amounted to the change of vari- 
ables x = u + vu. Remember that he would not use these symbols, but 
would instead consider z as a length and thus x? as the volume of a cube, 
so our modern statement x = u + v would for del Ferro have signified 
the chopping of a cube into smaller bricks. 

Applying z = u+ v to x? = 6x + 10 yields 


(u+v)? = 6(u+v) +10, 
which can be tinkered with!*®! to reach 
u? +u? + (3uv — 6)(u+v) = 10. (4.1) 


Now we are welcome to place a stipulation on wu and v in any way that 
we choose because we set x = u+v and it is x for which we are solving. 
We could require, for example, that u be twice v, or that the product uv 
must be 10. Here, it is convenient to force 3uv — 6 = 0, so that (4.1) 
becomes 

w+v?=10 with 3uv—6=0. 


The second equation simplifies to v = 2/u, which substituted into the 
first equation gives us 


U 


9\3 
w+(2) =10, or u®—10u2+8=0. 
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Letting z = u® allows us to apply the quadratic formula to 
z>—10z+8=0 

to obtain the solutions 

u = (5+ V17)'° and u = (5 — V17)"". 

Combining the first of these solutions with u® + v? = 10 reveals!**! that 
v=(-V17)'*, 

which is the second solution for u above. Thus, because x = u+ v, 

e= (5+ V17)' + (6-V17)4 (4.2) 


solves the equation 7? = 6x + 10. 

This clever tactic was like a treasure to del Ferro, because in his day 
it was the custom that teaching positions could be lost to aspiring chal- 
lengers. Decades later, Niccolo Fontana (Italy, born c. 1500), also known 
as Tartaglia, rediscovered del Ferro’s manipulation, and he shared it with 
Cardano after much cajoling. The widely available details of this story 
could pass for overwrought fiction. 

Cardano’s book Ars Magna (The Great Art) contains not only del 
Ferro’s technique but also a method for “depressing” the equation 


Av? + Br? +Cr+D=0 


by eliminating the x? term. Divide by the leading coefficient A to obtain 


x? + be? +cx+d=0, (4.3) 


where b = B/A and so on. We briefly foray into calculus to motivate the 
forthcoming change of variables that will depress this cubic. Let f(x) 
denote the left side of (4.3), so f(a) = 6a” + 2b. Thus the curve f will 
have its inflection point on the y axis when b = 0. So all that we must do 
to drop the x? term from (4.3) is shift the curve left or right to place the 
curve’s inflection point on the y axis. This horizontal shift requires the 
substitution of « — a for x, so (4.3) becomes 


(c —a)? +.0(z — a)? +ce(x@ —a) +d=0. (4.4) 


Now (b — 3a)? is the x? term in the expansion of (4.4), so to drop this 
term, we require that a = b/3. We therefore substitute"! x — 6/3 for x. 
The shifted curve has different roots than the original (4.3), but reversing 
the shift poses no difficulty. 
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4.5 A truly curious thing 


Let’s revisit del Ferro’s technique, this time watching it work on the de- 
pressed cubic 
x? = 3px + 2q 


with p and q positive numbers (the 3 and 2 are choices that turn out con- 
venient in a bit). Now because p and q are positive, then the terms on the 
right would have had negative coefficients in their original positions on 
the left. Many mathematicians of Cardano’s day still disallowed negative 
numbers, so for them our moves would be nonsense. 

Let x = u+ vas before. Then 


(u+ v)3 = 3p(u + v) + 2¢ 
can be rewritten 
u® +u? + (3uv — 3p)(ut v) = 2q, 


whereupon we choose to set 3uv — 3p = 0, or v = p/u. Replacing v in 
u? + v3 = 2q, we obtain 


3 
w+ (*) = 2q. 
uw 


We multiply by u® and let z = u®. The result 
2? —I%qz+p? =0 


leads via the quadratic formula to the solutions 
1/3 
u= (at Ve? —p) ' 
As in the previous section, we conclude that 
1/3 1/3 
x= (a+ Ve? —p) + (a- Va? = 2) (4.5) 


is a solution to the original depressed cubic. We can check (4.5) against 
(4.2) if we note that (4.2) is a solution to 


z® = 62 +10 
and so p = 2andq = 5in 


x? = 3px + 2q. (4.6) 
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Now if we think about the depressed cubic (4.6) as the intersection 
of the curves x° and 3pz + 2q and do a little calculus, we discover a 
curious thing. Consider the situation in Figure 4.3, where both p and q 
are positive, and the straight line y = 3px+2q is drawn tangent to y = 2° 
at some point (a, a3). The y intercept of the straight line is (0, 2g) and its 
slope is 3p. We also know that its slope is 3a? from the derivative of «°°, 


sop =a’. 


Y= 3x4 29, 


| 


~~ 
— 


Figure 4.3 


If we adjust the slope of the straight line so that its slope 3p is greater 
than the slope of the tangent line PQ, then the straight line will intersect 
the curve in the first quadrant and twice more in the third quadrant. In 
other words, when 

3 
—2 
3p > pe ae : 
a 
then (4.6) will have three real solutions. Knowing that p = a? from 
above (and bearing in mind that a < 0 and p > 0), we may write!!! the 
inequality as 


g—p <0. (4.7) 
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Thus we have shown that «* = 3px+2q has three real solutions precisely 
when q? — p® in (4.5) is negative; in other words, the equation’s solutions 
are all non-chimerical only when the formula generates chimerical num- 
bers. 

Take x? = 6x + 4 as an example. Here p = 2 and q = 2, so (4.5) 
gives the solution 


p= 2a PH O=\ee”, (4.8) 


But because (4.7) holds for p = 2 and q = 2, we know that the curve 
x° intersects the line 6x + 4 three times. In fact, we can check that the 
intersections occur at x = —2andx =1+ V3 andz=1-—- VJ/3. 

Yet not only does (4.8) contain the chimera \/—4, but also it appears 
to give us just one solution, not three. To probe these mysteries, we must 
learn how expressions like 2 + \/—4 can be visualized. 


4.6 The complex plane 


A few decades after Cardano published his results, Rafael Bombelli 
(Italy, born c. 1526) coaxed them into revealing the real numbers hiding 
within. Provoked by the curious fact that (4.5) produces three real solu- 
tions precisely when its square roots contain negative numbers, Bombelli 
extended the rules that govern the behavior of real numbers to apply to 
chimerical numbers as well. 

To illustrate, we will simplify (4.8) to c = —2 as Bombelli would 
have done it (in fact, Bombelli used our x? = 6x + 4 as an example him- 
self). Let’s assume that the cube root of 2 + \/—4 has the same structure 


p+./—q. Then 
(p+ V—a)° =24+ V-4, (4.9) 
(p— V=g) =2- V-4. 
Multiply the corresponding sides together to obtain 
pe +q=2. (4.10) 
This maneuver requires trust in the rule 


J/—-9¢ x V—q = -4¢. (4.11) 
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It mimics the way that square roots act on positive numbers, but it need 
not necessarily be the way that square roots act on negatives. Bombelli 
gamely took (4.11) as a hypothesis and continued. 

We expand the left side of the top equation in (4.9) to see that 


(p? — 3pq) + (3p — g)/—q = 2+ V—4 


and, matching the non-chimerical terms left and right, we have 


p° — 3pq = 2. 
Together with (4.10), this yields 
2p* = 3p + 1, (4.12) 


a cubic equation with one accessible solution p = —1, which then gives 
us g = 1. Thus —1 + /—1 is acube root of 2 + /—4. Let’s check: 


(- 1+ V=1)° = (-1)8 + 3(-1)?V=T + 3(-1)(V=1)? + (V=T)? 
= -143V=14+34 (-)V=I (4.13) 
=2+2V-1 
=24+ 7-4. 


Similarly, we could calculate that (—1—./—1)? = 2—,/—4 and therefore 
that at (4.8) we have x = —2 as desired. All of this presupposes the good 
behavior of chimerical numbers, such as in the last step, where we used 
the rule 


Vm x /n = /mn 


as if it applies to negative m and n. But the fact that Bombelli’s approach 
appears to work certainly solidifies our confidence in these extensions of 
the rules of algebra. (See the endnotes!>”! for another approach.) 

Bombelli’s efforts, while admirable and evidently successful, leave 
us with another cubic to solve at (4.12). In this case, the solution p = —1 
can be guessed. But what if guessing fails? And are there other solutions 
to (4.12)? 

It took more than two centuries before an ingenious insight answered 
these questions almost effortlessly. This innovation, a way of visually 
capturing the behavior of chimeras like 2+ /—4, narrowly escaped John 
Wallis in the late 1600s. Recall that Wallis suggested that a chimerical 
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number might carry a geometric meaning if we allow numbers to some- 
how point in a direction. In 1799, Caspar Wessel (Norway, born 1745) 
argued that we might benefit from treating numbers like 2 + \/—4 as a 
pair of numbers, first by factoring \/—4 into 2\/—1 and then by lifting 
the coefficients of 2+ 2,/—1 (as if /—1 were a variable) into an ordered 
pair (2,2). Now this pair can be treated like the tip of a vector in the 
plane, forcing the vertical axis to represent a \/—1 number line while the 
horizontal axis continues to represent the usual number line. 

Not long after, Carl Gauss (Germany, born 1777) independently 
conceived of such an approach. Gauss used the symbol 7 to represent 
/—1 and the term complex for numbers of the form a + bi. The con- 
stant a is the real part of the complex number, and the coefficient b is the 
imaginary part. From here, we will follow these conventions. 

Although Wessel and Gauss took different approaches to visualizing 
complex numbers, their efforts were at heart quite similar to our modern 
approach. We associate the complex number a + bi with the point (a, b) 
and the vector (a, b) shown in Figure 4.4. We call this plane the complex 
plane. The magnitude of a complex number a + bi is denoted |a + bi| 
and refers to the length r of the vector (a,b), which is found by the 
Pythagorean theorem to be 


la + bi] = Va? + b?. 


The angle 6 created by the vector (a, b), as measured (in the usual way) 
counterclockwise from the positive horizontal axis, is denoted arg (a + bi) 
(for argument) in most venues, but here, in this less intense setting, we 


(a,b) 


ee ee 


Figure 4.4 


112 4% 


will use the usual angle symbol; for example, 
Z T 
Z(1+i)= 7 and Z(—2)=7. 
A complex number a + bi also has a polar form 
a+ bi = r(cos 0 + isin @) (4.14) 


as indicated'*3! by the figure. 

Earlier in this section, we followed Bombelli’s approach in turning 
(2 + 21)!/3 into —1 + i. His method was inspired but technical. We turn 
our attention now toward finding a way to raise each side of (4.14) to 
powers. This goal requires an understanding of how complex numbers 
multiply. In short, to multiply two complex numbers with the help of the 
complex plane, we multiply their magnitudes and add their angles; that 
is, to calculate 


p+qi = (a+ bi)(c+ di) 
geometrically, we can use 


(1) |p+qi| =|a+ bi|- |c+ dil, 
(2) Z(pt+qi) = Z(at+ bi) + Z(c+ dt). 


Anyone unfamiliar with how or why these observations are true can find 
help!“! in the endnotes. Armed with this technique for multiplying num- 
bers in the complex plane, we can recalculate Bombelli’s (2 + 2i)!/3 in 
a new way. Because the angle created by 2 + 2% is 7/4, we know that 
its cube root must create the angle 7/12. And because the magnitude of 
2+ 2i is V8 or (/2)3, we know that its cube root must have magnitude 
,/2. These observations lead to Figure 4.5, where we see!>*] that 


” 
Whe | eS 


p 


Figure 4.5 
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p = V2cos (m/12) = 


q = V2sin (1/12) = = s 


So 


1+v3_ Vv3-1 
2 7 2 


which we can check by cubing both sides. 


i= (2+21)18, 


But where is Bombelli’s cube root —1 + 7 from (4.13) earlier in this 
section? This solution hides in the fact that there are two other angles 
besides 7/12 that triple to 7/4, namely 37/4 and 177/12. (In fact, there 
are infinitely many other angles that triple to 7/4, but these are the three 
that lie between 0 and 27.) The first of these alternative angles leads us 
to calculate 


V2 (cos (37/4) + isin (37/4)) = V2 (- ay 


=-1+i, 


thus unearthing Bombelli’s solution. The third angle yields°! a third 
cube root 


V2 (cos (17m /12)+isin (177/12)) = se (v2 ~V34 inf ae v3) 


which we can check by cubing the right side to get 2 + 22. Once we see 
how the multiplication of complex numbers connects to the addition of 
the angles of their corresponding vectors, it becomes clear that there are 
n versions of the nth root of a complex number. This appealing symmetry 
helped make the use of complex numbers as routine as it is today. 


4.7 In(i) 


As we have seen, cubic equations revealed a surprising and undeniable 
link between complex numbers and real numbers, and innovations in vi- 
sualizing complex numbers simplified our manipulation of them. These 
investigations helped to bring an end to widespread suspicion of complex 
numbers during the centuries spanning the 1500s to the 1800s. 
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But even while suspicion faded, puzzles continued to vex the mathe- 
matical community. In the early 1700s, for example, Johann Bernoulli 
(Switzerland, born 1667) compared the integral 


1 
/ ee dx = arctan x (4.15) 


to the very similar 


1 1 1l+¢2 
Jom te = 5(7*2), 


We typically use trigonometric substitution to calculate the first integral, 
but partial fractions to find the second, as follows: because 


1 1 pos! ye 


1-22 (1+a)(1—z) 1+a 1-2’ 


we can say that 


1 1 1 a 1+ 
fe pt) 5 In(1 ©) = 53n(722). 


Now Bernoulli wondered what might happen if he applied the method of 
partial fractions to the first integral. Factoring 


1 1 


l+22 (14+ izr)(1—iz) 


requires complex coefficients, so we might have complex numerators 
when we proceed to the partial fractions decomposition 


1 A+Bi C+Di 
Sea 


(l+ix)(l—iz)  1l+iz l-ix © 


It follows®”! that A = C = 1/2 and B = D = 0, s0 


1 1 1 1 1 
/ dx = / — dx + / — dx 
1+ 22 2) 1l+ia 2/ l-iax 


1 1 
5 nl + iz) — 5 In(1 — ta) 


1 1+iax 
21 1—ix 
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Using (4.15) to replace the left side, we get 


ik (=<) 
arctang = —In ; 


21 1—ix 
a perplexing statement that links a function from trigonometry to the 
logarithm of a complex expression. When x = 1, the left side equals 


arctan (1) = “ 


while on the right we get 


which appears to be extending the domain of the logarithm function be- 
yond what we may be used to. Quirky results like this sparked a debate 
between Bernoulli and his mentor Gottfried Leibniz over the meaning of 
logarithms of negative or complex numbers. Both men wished to honor 
the rules of algebra and logarithms as they pertain to real numbers, but 
their conclusions differed nevertheless. For example, Bernoulli argued 
that because 


then In (a) must equal In (—2), as they share the same derivative. Leib- 
niz countered that one of Bernoulli’s two derivatives must be meaningless 
because one cannot find the logarithm of a negative number. Their dis- 
pute occupied them over a lengthy correspondence, with no agreement 
reached. It fell to Bernoulli’s own student Euler to sort this out. 


4.8 i0 = In(cos@ + isin@) 


Euler objected to Bernoulli’s conclusion that functions with equal deriva- 
tives must themselves be equal, using 
d 2 1 d 


qq; m2) = ar = — In (x) 


116 4% 


as a counterexample. But Euler also pushed back against limiting loga- 
rithmic differentiation to positive numbers, writing that Leibniz’s sugges- 
tion, “if it were right, would shake the foundation of all analysis, which 
consists principally in the generality of the rules and operations which 
are judged to be true, whatever the nature we suppose the quantities to 
be that we apply them to.” 

After reacting further to both Bernoulli and Leibniz, Euler detailed 
his own mental travails as he grappled with what meaning the logarithm 
of a negative number or the logarithm of a complex number could carry. 
In particular, he explained that his original hypothesis, that such a loga- 
rithm could only have one value, evolved to include the possibility that 
there may be more ways, even infinitely many, to evaluate such a loga- 
rithm. Ultimately, he struck upon the following argument and its almost 
preternatural conclusion. 

Start with « = cos@ and y = sin@ as labeled on the unit circle in 
Figure 4.6. By the Pythagorean theorem, we know that = \/1-— y?, 
and from y = sin 6 we find dy = cos @ dé as the differential. Thus 

dy _ dy dy 


cos6 ot 1-42 


Substitute iz for y (so dy = i dz and 1— y? = 1+4 2?) and then integrate 
to obtain 


fw / 1 dz 9 a dz 
= | —— or Se 
V1+ 2? V14+ 22 


f 
t 
} 
i 
eis 
4 / I . 
/ i Y= 61m @ 
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A: nee | Cee see 
K=cos 8 


Figure 4.6 
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This integral succumbs to the trigonometric substitution 


z=tana, so dz=sec?ada, 


g=i | seca da. 


From elementary calculus we know that we can write 


leading**! to 


6 =iln(seca+ tana) +C 


and so, because z” = tan? a = sec? a — 1, we get 


6=in(V1+24+2)4+C 
=iln(/1—-y2+y/i)+C 
=iln(/1—y? + iy/i?)+C 
=iln(/1—y? —iy) +C. 


As Figure 4.6 makes clear, when 6 = 0 we have y = 0, so C = 0 as 
well. We may tweak!**! the last expression to obtain 


6 = —iln(/1— y? + ty) (4.16) 


and multiplying both sides by 7 we get 


10 = In(cos@ + isin6). (4.17) 


Here, Euler noted that the periodic natures of the trigonometric functions 
cosine and sine allow for infinitely many solutions, so (with n as an inte- 
ger) he wrote (4.17) as 


i(@ + 27n) = In (cos6 +isin6) (4.18) 


to reflect this. 
Euler checked the plausibility of (4.18), with judicious choices of 6, 
against the existing properties of logarithms. Letting 6 = 0 we get 


i(27n) = In(1), 


and n = 0 yields the familiar In (1) = 0. With n ranging throughout the 
integers, we see that 


In(1)= 0, Qt, Ani, 671i, ete. (4.19) 


Setting 6 = 7, on the other hand, produces 
i(m + 27n) = In(—1), 


which as a list is 


In(—1) = TH, 37%, 571, Tri, ete. 


One property of logarithms is In (7) = mn (a), so Euler checked it 
with his results. Because (—1)? = 1, we can write 


In (1) = In ((-1)?) = 2In(—1) = £2mi, 46mi, £107, etc., (4.20) 


all solutions that appear in the list (4.19). (We do not generate the entire 
list because — 1 is just one of the two square roots of 1, so we could have 
written In (17) in the second spot of (4.20), which would have generated 
the missing entries.) 

Euler carefully checked other cases until he was satisfied that his ar- 
gument had produced a conclusion that did not contradict existing knowl- 
edge. With his results, he settled the dispute between his mentor Bernoulli 
and his mentor Leibniz. But what he discovered was much more, like a 
path leading to even more curious places. 
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A few decades before Euler set the argument in the previous section on 
paper, Roger Cotes (England, born 1682) drew conclusion (4.17) from 
his study of the lengths of certain curves. Exponentiating (4.17), we reach 
e’? — cos + isin 8, (4.21) 
and though Cotes did not take this final step, we commonly call this the 
Euler-Cotes formula. 
The formula’s beauty originates not only in the bond that it forges 
between e and 2 and thus between the real and the complex, but also in 
the intriguing results that it implies. Both of these points can be made at 


once by obtaining 
e™ =—-1 (4.22) 


from (4.21) when 6 = x. In this equation, a real irrational number raised 
to a complex power equals an integer, and three of this book’s four con- 
stants coalesce in a six symbol truth. In a moment, we will learn to 
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view this arcane equation as something accessible and plausible, but that 
knowledge need not stifle our admiration of the thinkers who alchemized 
this equation. 

If we choose to set 9 = 7/2 in (4.21), we can then manipulate!™! the 
result to reach either 

= ji = eo 7/?, (4.23) 

depending on the choices that we make as we simplify. The first ex- 
presses 7 as an unexpected ratio of complex quantities, and the second 
reveals that a complex number raised to a complex power can simplify 
to a real number. In this case, 


i’ = 0.2078795763507619.... 


These peculiar results are merely instances of the Euler-Cotes for- 

mula 
e’? = cos6 + isin 8, 
and a visual interpretation of it can help us assess its odd offspring. Re- 
call from section 4.6 that the right side is a complex number that can be 
identified with a unit vector in the complex plane that makes the angle 
6 with the positive horizontal axis. Thus, the left side e’” can itself be 
linked to this vector, as in Figure 4.7. So when @ = 7, the vector asso- 
ciated with e’” points to the location —1 on the horizontal axis. So now 
we can see that 
e7 =-1, 


et") : 


Figure 4.7 
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Using the same technique, we can set 0 = 7/2 and see that 


ei(n/2) = i, 


from which we generated the mysteries in (4.23). 


4.10 The shortest path 


Much of the final resistance to complex numbers faded as it became clear 
that their behavior posed no threat to the rules and operations of algebra. 
On the contrary, quite often the complex realm opened paths that made 
already existing results easier to prove. A useful tool for illustrating this 
effect is named after Abraham de Moivre (France, born 1667), and fol- 
lows easily from the Euler-Cotes formula 


e? = cosé + ising 


by replacing 6 with né. On the left, 
el(n) — (€)" = (cos@ + isin 8)”, 


so 
(cos 6 + isin 6)" = cos (n@) + isin (nd). (4.24) 


From this formula we can derive trigonometric identities using just a 
smidgen of algebra. For example, let n = 2 in (4.24) and expand the left 
side to get 


cos” 6 — sin? 6 + 2i cos @sin 6 = cos (20) + isin (26). 
Comparing the real parts, we see that 
cos (20) = cos” 6 — sin? 0, 
and doing the same with the coefficients of 2 yields 
sin (20) = 2cos@ sin 0. 


Both of these identities are common and useful, and this derivation of 
them could hardly be more straightforward. 
If we let » = 3 in (4.24) and carry out the same procedure, we can 
show that 
cos (30) = cos? 6 — 3cos @ sin” 6 
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and, using sin? 6 = 1 — cos? 6, we can simplify to 
cos (30) = 4cos? @ — 3cos 0. (4.25) 


Again the derivation is simple, but notice that in this case, with x = cos 0, 
the right side of (4.25) becomes 4° — 32, which mimics the structure 
of a depressed cubic equation. Exercise 2 explains how Francois Viéte 
(France, born 1540) used (4.25) to solve such equations. 

Further, de Moivre’s formula (4.24) simplifies calculations that in- 
volve powers of complex numbers. As an example, we will find a fifth 
root of the number 1 (specifically because we will be reminded of our 
study of the golden ratio y). As we saw in section 4.6, the multiplica- 
tion of complex numbers is linked to the addition of the angles that they 
make with the positive horizontal axis. This fact distributes the vectors 
that correspond to the fifth roots of 1 in the pentagonal fashion depicted 
in Figure 4.8. We will not be surprised, then, when vy appears in our work, 
knowing the pentagon as its birthplace. 

Focus on the vector pointing at P in Figure 4.8. Its angle equals 27/5 
(or 72°), so we choose @ = 27 and n = 1/5 in (4.24) to get 


1/5 


(cos (2m) + isin (2m)) '” = cos (27/5) + isin (27/5). 


The left simplifies to 1!/° as desired. For the right, recall that 
tan (27/5) = /4¢ +3 from section 1.3. Thus, 


tan (27/5) = oe 4p +3 and 2? +y?=1 
x 


ion 
Laps, & 
BY \ a S 
< i \ ‘ 


Figure 4.8 
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(this second fact by the Pythagorean theorem). Solving'®!! for x yields 


1 / 2 
x = cos (27/5) = 3” andso y= sin (27/5) = — ; 
~ 
(4.26) 
So we have our fifth root of 1: 
1 L/ 2 
(cp TE (4.27) 


2p 2 


Raising the complex right side to the power 5 gives the simple result 1. 
We could find the three remaining fifth roots at the other vertices of the 
pentagon by the same technique. Alternatively, because they are 


ee, eae and 1, 


we could instead take powers of (4.27). Either way, the symmetry in 
Figure 4.8 plays to our advantage. 


411 p=e'™ + e-i7/5 


What better punchline for this book than the formula in the heading of 

this section, which binds all four of our constants into one elegant ex- 

pression? We have just the results in our toolkit to reach it quickly. 
Replace @ with —@ in the Euler-Cotes formula 


e”? = cos6 + isind 


and use the negative angle trigonometric identities 
cos(—6) =cos@ and sin(—0) = —siné 


to reach 
e'? = cos6 — isind. 
Adding this to the original formula gives us 
2cosé =e +e~%. (4.28) 


From (4.26) we have cos (27/5) = 1/(2y), and with the help'®?! of the 
half-angle identity for cosine we can convert this to cos (7/5) = y/2. 
And so 

y= eit /5 4 e7it/5 


pulls everything together into a fitting last word. 
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4.12 Further content 


4.1 Khayyam’s geometric solution to a cubic. Many centuries before 
Cardano and Bombelli studied cubic equations, Omar Khayyam 
(Persia, born 1048) published a treatise on mathematics that included 
a thorough treatment of such equations. True to his time, his ap- 
proach was geometric, not heavily symbolic, and averse to negative 
numbers. For example, to describe the cubic 


x? +az?+bx =c, (4.29) 
Khayyam wrote, 
A cube plus squares plus edges equals a number, 


and he assumed that all necessary numbers were positive. 


Limiting his tools to compass, straightedge, and hyperbola, Khayyam 
showed how to start with segments of lengths a, b, c (the coefficients 
in the cubic equation (4.29)) and construct a segment with length 
equal to the solution to the equation. This exercise outlines his pro- 
cedure. 


We begin with a hyperbola. In Figure 4.9, focus on segments AB 
(think of it as a horizontal axis) and AC (think of it as a vertical axis) 
and the curve TE, which depicts a hyperbola with AB and AC as 
its asymptotes. Using modern notation, we could describe this curve 
with the equation y = a/x for some a > 0 and say that we are only 
showing a piece of the curve in the first quadrant. Khayyam had 
none of this modern apparatus. For him, the relationship between a 
hyperbola and its asymptotes kept a particular product fixed: at any 
point on the hyperbola, the product of the distances to each asymp- 
tote from that point equaled a constant. Thus, for example, 


GC-GJ = ED- EB. (4.30) 


As Khayy4ém would phrase this, the areas of rectangles 
GCAJ and EDAB are equal. 


Now let’s build Figure 4.9 from a blank page using our simple tools. 
Draw a segment AB with length c/b and then construct AD per- 
pendicular to AB so that AD has length Vb. (We learned how to 
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Figure 4.9 


construct ratios and square roots earlier in this section, using Fig- 
ures 4.1 and Figure 4.2.) Complete rectangle E DAB. Extend AD 
to AR. 


Situate a hyperbola so that AB and AR are its asymptotes and it 
passes through the point &. A hyperbola can be drawn using simple 
tools, but not with just compass and straightedge. This is why we 
listed the hyperbola that we just placed into the figure among our 
tools. 


Finally, extend &D through D to F' so that DF has length a. (Now 
all three coefficients in our cubic equation appear in the figure.) Con- 
struct a semicircle with FF as its diameter, and let G be the point 
of intersection of the semicircle and the hyperbola. Locate C on AR 
so that AC and C'F are perpendicular. Last, label points H and J as 
shown. 


Khayyam analyzed this figure along the following lines. 


(a) Starting with (4.30) and using the figure, show that 
GH.DH=AD-HE. (4.31) 
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4.2 


(b) When a perpendicular connects a point on a semicircle to the 
diameter, as is the case with GH and F'E, then the square of 
the perpendicular’s length equals the product of the lengths of 
the two pieces of the diameter. (Using Figure 4.2, we saw this 
earlier.) Thus, 


GH? = FH -HE. (4.32) 

From (4.31) and (4.32), derive 
AD?. HE = DH? . FH. (4.33) 
(c) Replace FH with DH + FD in (4.33), and then add DH - AD? 


to both sides. Simplify the result to obtain 
DH? + FD-DH*+AD?.DH=AD?-ED. (4.34) 
(d) Explain why (4.34) provides the answer that Khayyam sought. 
Historical note. Khayyam, bound by his rejection of negative num- 


bers, provided similar constructions for various versions of the cubic 
equation (4.29). 


Viete’s trigonometric approach to cubics. In section 4.10, we used 
the Euler-Cotes formula to generate the trigonometric identity 


cos (30) = 4cos? 6 — 3cos 8, 
which may be written 
cos® § = ; cos 0 + ; cos (36). (4.35) 
This formula’s resemblance to the depressed cubic equation 
x = 3px + 2q (4.36) 


(from section 4.5) prompted Viéte to discover a way to solve the 
depressed cubic equation with the help of (4.35). 


(a) The conspicuous choice 7 = cos@ fails because x represents 
a solution to the cubic equation, so can potentially take on any 
value, while cos @ is trapped between —1 and 1. To free cos @ 
from its bounds, we multiply it by an arbitrary constant a and let 
x = acos6@. Use this substitution to write (4.35) as 


a a? 
xg =3 (5) x + = cos (30). (4.37) 
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(b) Comparing (4.37) to (4.36), it seems prudent to let 
p= a? /4 , which is equivalent to a= 2,/p, (4.38) 


and 
3 


2q = = cos (36). 


Show how from these results we can reach 


1 q 
d= 3 arccos (+n): (4.39) 
(c) We initially let x = acos 0, so (4.38) and (4.39) together yield 
mes q 
x = 2,/p cos 3 arccos pe . (4.40) 


This is Viéte’s formula. In section 4.5, we found the solutions of 
x = 6x +4 to be x = —2andx = 1+ V3. Use (4.40) to find 
one of them. 


(d) The expression g/p?/? 


and 1 for the inverse cosine function to give a real number. Ex- 
plain the connection between this observation and (4.7) from 
section 4.5. 


in (4.40) must be contained between —1 


(e) If the expression g/p?/? 


and 1, then the inverse cosine function outputs an complex num- 
ber. Describe how (4.16) and Figure 4.6 may be used to calculate 


arccos (2) = iln(2 + V3) 


in (4.40) is not contained between —1 


as an example of this phenomenon. Note that the cosine of each 
side results in 


cos (iIn (2 + V3)) = 2, 


and because we evaluate trigonometric functions at angles, here 
we have found the cosine of a complex angle. 


4.3 A complex approach to the Basel problem. Section 2.9 gave one 
of Euler’s proofs that 
n 1 1 1 1 
— ieee 441 
6 pints tp os) 
(the solution to what is known as the Basel problem). Here we out- 
line a clever alternative that makes use of our knowledge of complex 


numbers. 
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(a) From elementary calculus, we know that the series 


x x x 
tt - Leia 4.42 
a a ea rae Sar pe ae 


converges for —1 < x < 1. May we safely replace x with a com- 
plex expression? The answer is yes, so long as certain conditions 
are met. The note at the end of this exercise elaborates. For now, 
we will trust that replacing x with e~?"* poses no threat. 

Make this substitution in (4.42) and then integrate both sides 
from x = 0 to x = 7/2 to show that 


m/2 ; 
| In (1+ e7 2") da 
0 


is equal to 
—] ae ’ eo 2't ate] rn e7 3tt = 4 
a \© 2 32 
(b) Explain how e’** = —1 (from (4.22)) helps us simplify this to 
: 1+ z + : oa : =F 
a BA Be 72 
As we saw in section 2.9, the sum in the parentheses is 3/4 of 
the sum on the right side of (4.41). 
(c) By factoring e~*” from both terms inside In (1 + e~?"”), and 


then using properties of logarithms along with (4.28), show that 
In (1+ 77") = —ia + In (2cos 2). 


(d) Use all results that we have thus far established to obtain 


is [mr 3/1 1 1 
| In (2 cos) de = i] = +(3 Poa = J]. 
(4.43) 


(e) The integral on the left of (4.43) is improper (at 7/2), and the an- 
tiderivative of In (2 cos x) is not elementary, but it can be shown 
(using theory that is beyond the scope of this book) that the in- 
tegral converges to a real number. The right side of (4.43), by 
contrast, is merely a real coefficient times 7 and is thus purely 
imaginary. Therefore, to be equal, each side must equal 0. Set 
the right side equal to 0 to establish (4.41). 
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Note. As hinted at in part (a), we may only replace x with e~?*” 


under the condition that |e~?"*| < 1. But 
|e~*"*| = | cos (2) + isin (22)| 


= (cos? (2x) + sin? (2x))\/ 
=1 


y] 


which then (it turns out) requires that we carefully take limits. 
The details do not lie within the purview of this book, although 
they lie right on the boundary. Investigations into matters like 
this occupied mathematicians long after complex numbers were 
established as meaningful and useful. 


4.4 Hamilton discovers the quaternions. The complex number 
a+ bi where i? = —1 


can be associated with the ordered pair (a, b) and visualized on the 
complex plane as a vector that extends to that point. Further, the mul- 
tiplication of two complex numbers in the complex plane involves 
the addition of angles, and therefore rotation. Like us, William 
Hamilton (England, born 1805) knew these facts, and he asked the 
natural questions: might there be trip/et numbers with two imaginary 
parts, of the form 


a+bi+cj where i? = —1 andj? = —1 buti 4 j 


as well? And might the product of two such numbers involve rotation 
in a three dimensional space? 


Hamilton wished to preserve the behaviors of complex numbers while 
he pushed into new territory, so he stipulated that 


The magnitude of the product of two triplets should equal 
the product of their individual magnitudes. 


Complex numbers obey this rule (see (13) in Extra Help if need be). 
(a) To find the magnitude of a triplet a+ bi + cj, we need a three di- 


mensional extension of the Pythagorean theorem. Sketch a vec- 
tor that extends to the point (a,b,c) and use the Pythagorean 
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theorem twice to show that 
lat+bit+cj| = (2 4+0? +7)”. 


Recall that the notation on the left signifies the magnitude (or 
length) of the triplet. 

(b) Hamilton checked if the multiplication rule held when a triplet 
was multiplied by itself. He computed 


(a+bi+cj)? = (a? —b? —c*) +i(2ab) + j(2ac) +77 (2be). (4.44) 


Verify his result. Note that to reach his conclusion, 
Hamilton assumed that 77 = jz, a reasonable assumption given 
that multiplication is commutative for real numbers. 

(c) Now show that when we find the magnitude of each side of 
(4.44), the right side’s magnitude exceeds the left side’s by the 
term (2bc)?. 

(d) This mismatch of the two magnitudes could be corrected if we 
set 27 = O, but this would contradict the multiplication rule. 
Explain both the correction and the contradiction. 

(e) Hamilton realized that the mismatch could also be repaired if 
he abandoned his assumption that 77 = j2, replacing it with 
ij = —jt. How does this do the trick? 

(f) Updating his assumptions, Hamilton asserted that 


#=-1 and j?=-1 but ij =—ji. 


This last hypothesis fixes (4.44), as we have just seen. So Hamil- 
ton tried multiplying two different triplets a + b¢ + cj and # + 
yi + zj using his new assumptions. Check that the product (a + 
bi + cj)(a + yi + 27) equals 


(ax — by — cz) + i(ay + bx) + j(az + cx) +17 (bz — cy) 


and then explain why the magnitudes differ by a factor of 4bcyz. 


Historical note. After further struggle to repair this new error 
(his son routinely asked him at breakfast if Papa had solved his 
puzzle yet), Hamilton was struck by inspiration while on a walk 
with his wife. He realized that he could allow a third imaginary 
number k to join i and 7, forming the quaternion a + bi + cj + 
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dk, and let the troublesome product 77 equal &. He once again 
tweaked his assumptions to read 


P=-1 jf =-1 kh? =-1 
Gah: Gat MSF 


and then carefully checked the magnitudes of multiplied quater- 
nions in the way that we have done throughout this exercise. This 
time, everything fell into place. Hamilton had discovered a new 
set of numbers that behaved in many ways like complex num- 
bers, but with a remarkable new structure that had never before 
been thoroughly studied. 

Hamilton’s results conclude this chapter on 2 for three reasons. 
His story reminds us that in mathematics, the door just opened is 
rarely the last; almost every discovery results from persistence 
and risk taking; and one need not be a lifelong expert to be a 
pioneer. These takeaways underlie everything contained in this 
book. 


APPENDIX A 


Wallis’s original derivation of 
his formula for z 


At (2.11), we saw John Wallis’s formula 


mn 2 2 4 4 6 6 
=-X=xX=xK xX ixX sXe 
2 1 3 3 5 5 7 


but did not pursue his original derivation due to the extensive work re- 
quired. His clever argument, however, belongs in this book, as it testifies 
to the ingenuity of mathematicians who lacked the tools of calculus and 
were forced to make do with basic algebra and geometry. Here is how 
Wallis proceeded. 

On the left in Figure A.1, we see a quarter circle inside a unit square. 
The quarter circle can be described by the equation y = (1 — x?)!/?, and 
we will use this description despite the fact that Wallis did not have this 
convenient notation available to him. (Wallis did draw figures like those 
in Figure A.1, but described the proportions of each curve in words.) The 
area of the quarter circle is 7/4 and, wishing to derive a formula for 7, 
Wallis guessed that the study of curves that fit the pattern y = (1—a™)” 
would prove fruitful. 

Let’s begin to follow Wallis’s path by considering the curve y = x°, 
inscribed in a unit square on the right in Figure A.1. What proportion of 
the square is shaded (so, below the curve)? Wallis argued that we could 
find this proportion by this strategy: 


1. Slice the square vertically (like from A to C) at equal 
intervals (as indicated by the marks along the bottom edge), 
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[re ee A ae i 


a A 
Figure A.1 


2. sum the lengths of each part of each slice that land below 
the curve (like AB), 


3. put this sum in ratio to the sum of the entire length (AC) 
of each slice, and 


4. let the number of slices become infinite. 


This slicing method has ancient roots. Democritus (Greece, born c. 460 
BCE), for example, suggested that if cross sectional areas of two vol- 
umes were equal throughout their equal heights, then the volumes must 
be equal. 

If there are n equidistant vertical slices, then their distances from the 
lower left corner L (see Figure A.1 again) can be marked 


O/n, 1/n, 2/n, ..., n/n. 
Thus, the ratio sought in step 3 above is 


(0/n)* + (A/n)? + (2/n)? +--+ + (n/n)? 
1+1+1+---+1 


or, multiplying every term by n°, 


074+ 194+ 23 +---4+n3 
m+tn+n3+---+n3 : 


Wallis experimented with this last expression in hopes of pinning down 
its behavior as n becomes infinitely large (part 4 of his strategy). The first 
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few cases are 


OPTS? 59) Te. 
84+8+8 24 4° 8 
0+14+8+427 36 1 #1 


97427427427 108 4° 12° 


Wallis concluded that ’If an infinite series of quantities which are the 

cubes of a series of continuously increasing numbers in arithmetic pro- 

gression, beginning, say, with 0, is divided by the sum of numbers all 

equal to the highest and equal in number, then we obtain 1/4.” So the 

shaded region on the right in Figure A.1 occupies 1/4 of the unit square. 
In a similar way, Wallis concluded that 


The region below the curve y = x” (where n is any positive 
rational number) occupies 1/(n + 1) part of the unit square (W1) 


that encloses it. 
1 
1 
| av’ dz = ; 
0 n a 1 


Now recall that Wallis was specifically interested in the area defined 


In modern notation, 


by curves of the form y = (1 — x)” so that he can investigate the 
quarter circle y = (1 — x?)!/?. Take y = (1 — x)? as an example, and 
remember that all areas are being found inside a unit square. Expanding 
and applying (W1), we have 


2 


area under (1 — x)? = area under (1 — 2a + x”) 


=., (A.1) 


Wallis created Table A.1 to store results like this one. Each cell holds the 
number WN so that the region under the curve y = (1 — gil P)* occupies 
1/N of the unit square. (Wallis put reciprocals in his table for clarity. 
Every fractional area has the numerator 1, so by using reciprocals, Wallis 
was simply ignoring their clutter.) So 3 appears in the cell for p = 1 and 
k = 2 thanks to (A.1). The cell containing UD marks the key quantity that 
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Table A.1: The reciprocal of each entry is the portion of a unit square (as in 
Figure A.1) beneath y = (1 — 21/?)*. 


k 
0 ”* lw 2 *% 3 
O;1 1 
Y, 
ee | 2 3 4 
~, 
2/1 3 6 10 
~ 
3 | 1 4 10 20 


Wallis was pursuing. The extra help!®! has further examples of how the 
entries in Table A.1 are calculated. 

We might also have filled in the rows for fractional p and whole num- 
bered & using an approach like in calculation (A.1), but delayed in order 
to highlight the sequences that appear in rows p = 2 and p = 3. The 
sequence 1,3,6,10,... consists of the triangular numbers and the se- 
quence 1,4,10,20,... lists the tetrahedral numbers. To fill in the en- 
tries between those in these sequences, in the columns that correspond to 
k, = 1/2 and k = 3/2 and so on, Wallis would have needed the ability 
to expand expressions with fractional exponents like (1 — x!/3)!/? and 
(1 — a'/?)3/?. He lacked this knowledge, but it would not be long until 
Isaac Newton discovered a way. (Section 2.7 elaborates.) 

Wallis instead appealed to a process called interpolation, which in 
this case amounts to making smart guesses as to what numbers come 
between those in a given sequence. The sequence 1, 2,3,4,... in row 
p = 1, for example, interpolates as 


1 3/2 2 5/2 3 Tho 4 


by common sense. But what numbers should occupy the blanks in the 
sequence 


1 3 6 10 


from row p = 2 in order to properly interpolate the triangular numbers? 
Wallis appealed to geometry. The modifier triangular” is prompted by 
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the visual representation in Figure A.2. This chart lists the number of 
dots in each figure and the number of dots along the base: 


# dots total: 1 3 6 10 


# dots along base: 1 2 3 4 
Wallis guessed that the second row should be interpolated in the obvious 
way, as in 
# dots total: 1 3 6 10 


#dots along base: 1 3/2 2 5/2 3 7/2 4 
No matter that a triangle like those in Figure A.2 can’t have a fractional 
number of dots along its base. Audacious leaps are hallmarks of note- 
worthy mathematicians like John Wallis. 

To interpolate the top row, Wallis determined the ratio of each entry 
in the top row to its counterpart in the bottom row. The nth triangular 
number has 1 + 2+ 3-+----+ 7 dots, and the well known formula 


1+2434---+n=5(n+1) 


enabled Wallis to find that 


# dots total S(n+1 1 
ots tota _ ain ) = Fin+)). 


# dots along base n 
Substituting 3/2 for the number of dots on a side yields 


# dots total 1/3 

3/2 = 2D 
Wallis therefore concluded that there are 15/8 dots in a triangle that has 
3/2 dots along its base. (Again, this imaginative result is not meant to be 


drawn.) The same process'™! applied to 5/2 and 7/2 fills in the top row: 


15 
+ 1) andso_ # dots total = e 
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# dots total: 1 15/8 3 35/8 6 63/8 10 
#dotsalong base: 1 3/2 2 5/2 3 7/2 4 


These results fill in row p = 2 of Table A.2. Note the symmetry that 
row p = 2 shares with column k = 2, which contains entries that can be 
calculated directly with (W1) without interpolation. Row p = 1 of Table 
A.2 has been filled in using interpolation and column k = 1 by direct 
calculation. Wallis interpolated the tetrahedral numbers 1, 4,10, 20,... 
by the same technique that he applied to the triangular numbers (but the 
table only includes the results for p = 3,k = 1/2 and p = 1/2,k = 3). 

Wallis was now ready to move on LU. Focus on row p = 1/2 of Table 
A.2, which has 


1 3/2 15/8 105/48 


as its numerical entries. Lacking a geometric metaphor (such as he found 
when he interpolated the triangular numbers) for these fractions, Wallis 
sought a purely arithmetic pattern within their structures. He found it: the 
sequence is generated by the partial products of 


3 


1x =x ears 
2 6 


| Or 


The numerators being all odds and the denominators all evens, Wallis 
reasoned that he could calculate the missing entries in row p = 1/2 


Table A.2: The reciprocal of each entry is the portion of a unit square beneath 
y = (1—'/”)*, (Here, a few more entries of Table A.1 have been filled in.) 


k 
0 %*&* 1k 2 3 

0} 1 1 1 1 

Y%} 1 Ys 'Y6 10S 4g 
p lil *% 2% * 38 % 4 

VY, ~ 3% 

2/1 % 3 *% 6 % 10 

~ L 6% 

3 | 1 Ys 4 10 20 
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using the partial products of 


yielding the interpolated row 


1 1x : x z 1x 2 x : x : x g 
2 3 2 =A 3°05 
These quantities grow at a slowing rate that Wallis analyzed in the fol- 
lowing way. Label the quantities in the sequence aj, a2, a3,..., and note 
mat a a a a a 
SS 2S Snes and os as 
ay a3 a5 2 4 ag 
Wallis assumed that, in like manner, 
OS ES aa, 
ay, ag a3 
and considered each inequality in turn. 
Starting with 
1x3 
se > a or > ce ) 
ay ag 1 

Wallis obtained 

_ 3 

5° 
Then es r 
OB rg PAN oe RICE x 4/3 
a2 a3 1x 3/2 

yields 

3 3 

<a 4 A2 

5 mi (A.2) 

The next such result is 
& 3 ‘ 3 . 5 
2 #4 4 


If we write (A.2) as 


eee 
2 #4 3) 
then we can merge the last two inequalities to see that 


ree oe ZA x : 
4 4 2 =#A 3° 


2 
2 
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Continuing in this vein, we obtain 


a a ee is Bs, OB 5 P 
x x x x < < x x x x : 
2° A a G 5 2° 4° 4° 6 4 


As we carry on, the two square roots approach 1, forcing the value 


ever closer to 
8.3 BOF 


xX —X— xX =X = xX 

2 4 4 6 6 8 

Because 1/O = 7/4, we have demonstrated the validity of (2.11), named 
after the thinker who so creatively discovered it. 


> re 


APPENDIX B 


Newton’s original 
generalized binomial 
theorem 


Section 2.7 presented the generalized binomial theorem without any hint 
of its origins. This appendix provides that story, which illustrates the 
fearless persistence with which mathematicians like Isaac Newton ap- 
plied themselves to unsolved problems. 

Table B.1 reorganizes John Wallis’s Table A.1 from Appendix A. 
Each entry in the table (to the right of the dashed vertical line) is the 
coefficient of the x” term in the expansion of the binomial expression 
(1 + 2)*. Newton extended the table’s columns to the left of the dashed 
line, allowing & to take negative values. Because each entry is the sum 


Table B.1: Each entry is the coefficient of the x” term upon expanding (1+2)*. 


k 

= en ee 1 2 3 4 

0 1 ii a 1 1 1 1 

||| <A i? ay 1 2 3 4 

m 2 3 tee - 6 0 1 3 6 
3; -4 -1!' 9 0 0 1 4 

4 5 1! 0 0 0 0 1 

5] -6 -1! 0 0 0 0 0 
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of the one directly to its left and the one above that, Newton completed 
columns & = —1 and k = —2 as shown. As a result, he conjectured that 
one could expand binomials like 


(l4+a)-* =1-a¢+2?—23+a*-—-- and 


(1+2)~? =1—22 + 32? — 42° + 5at —--. 


that have negative exponents. 

Now while Wallis interpolated the sequences that appear as rows to 
the right of the dashed line in Table B.1, introducing columns for k = 
1/2,3/2,5/2,..., Newton discovered a pattern in the entries of the table 
that allowed him to create a column for any fractional /. Any collection 
of columns in the first two rows of Table B.1 may be represented by the 
template 


1 1 1 1 1 1 
a-2 a-l a atl a+2 a+3 


(where the entry a need not be aligned with any particular entry in the 
second row of the table.) Newton hoped to interpolate the second row. 
Knowing that its entries would no longer differ by 1, he altered the tem- 
plate to read 


b b b b b b 
a—2b a—b a a+b a+2b a+3b 


to preserve the additive pattern (namely, that each entry is the sum of 
the one directly to its left and the one above that) while allowing the 
difference between the entries in the second row to be a constant b other 
than 1. Then, extending to a third row, Newton wrote 


b b b b b b 
a— 2b a—b a at+b a+ 2b a+ 3b 
—2a+3b+c -a+b+c ec atc 2a+b+c 3a4+3b+€¢€ 


using a new constant c and the additive pattern to fill in the row. This 
process may generate further rows, but we will stop at row three and 
show how this tableau helped Newton interpolate the rows of Table B.1. 

The columns of Table B.1 give the coefficients in the expansion of 
(1+x)*, where k is an integer. Newton aimed to allow k to be fractional. 
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Table B.2: The coefficients in the expansion of (1 + x)!/? 


spots marked 


will appear in the 


k 
Vr Ww 0 hw 1 hw 2 Hh 8 
—2 —1 
0 1 1 1 1 1 1 
m 1 0 1 2 3 
—2 —1 
2 3 fo) 1 ° 0 0 ) 1 ° 3 
3 0 0 0 1 
—4 —1 
4| 5 1 0 0 0 0 
5 0 0 0 0 
—6 -1 


Let’s follow his approach as we expand (1+ <2) 1/2. First, insert a column 
between each column of Table B.1 as in Table B.2. The numbers that 
belong where the LI symbols appear are the desired coefficients. 


Let’s target the LJ in the row for m = 2, where the symbol o marks 
the other missing coefficients for other values of k. Overlay row 3 of 
Newton’s tableau onto row m = 2 of Table B.2 so that a + c aligns with 
the 0 in column k = 0. From the matching 


2a+3b+c¢ at+b+e c ate 2at+b+ec 38a+3b+c 
° 1 ° 0 0 


that results, we obtain the equations 


-a+ b+c=1 
a +c=0 
3a+ 3b+c=0 


with solution a = —3/8,b = 1/4,c = 3/8. Thus, 


=2a+b+c=-1/8 


is the coefficient of «7 in the expansion of (1 + «)!/?. 
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Calculating the coefficient of «? in the expansion of (1 + x)!/? 


quires that we expand Newton’s tableau by a row and a column, and 
overlay the new row onto row m = 3 of Table B.2. If we align the new 
variable d directly under c in the tableau, then we can obtain the resulting 
system of equations 


re- 


a- b-— c+td=-l 


ct+td= 0 


3a+ 6b+3c+d= 0 


6a — 10b-— 38c+d=-—4 


with solution a = —1/4,b = 1/8,c = 5/16, d = —5/16. Thus, 


=a+2c+d=1/16 


is the coefficient of «? in the expansion of (1 + x)!/? 


the expansion 


. Newton produced 


(1+ a)v?=1+ 5 “0 b Tg™ — gag? tage 0 BD 
by repeatedly expanding his tableau and solving systems of equations 
just as we have done. 

To expand (1 + x)!/3, we must insert two columns between each 
column of Table B.1 before we overlay Newton’s tableau. Doing this 
yields the expansion 


1 1 5 10 22 
i 1/344 24 3 4 5 
(l+a)" = 3° x a2 a3 x 


after quite a bit of work. 

Newton’s method provokes some questions. Can we insert as many 
blank columns as we like? Does it matter how we overlay the pattern onto 
the table? Can’t we include infinitely many equations in the systems we 
solve? Perhaps because of these questions, Newton checked his results. 
An algebraic check of (B.1) involves squaring both sides to see if the 
infinite expression on the right squares to (1 + 2), and indeed it appears 
to do just that. 

Newton’s technique, while both imaginative and effective, requires 
a good deal of algebra. He sought and found a shortcut, the very same 
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we use today. Studying the coefficients in expansions like (B.1), Newton 
realized that multiplying together the first m + 1 factors of 


kK k-1 k-2 k-38 
x —>— xX 


produces the coefficient of x” in the expansion of (1 + x)*. This much- 
streamlined technique lies at the heart of what we call the generalized 
binomial theorem. A complete proof that properly addresses the skepti- 
cisms (listed in the previous paragraph) took over 100 years to emerge. 
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Extra help 


[1] Because the pentagon is regular, we know that isosceles AAED is con- 
gruent to isosceles ABAE. Thus all of ZABE, ZAEB, ZEAD, and ZEDA 
are equal. Let a be the common measure of these angles. Now ZEMD = 2a 
because, as an exterior angle of AAM E, it equals the sum of the two opposite 
interior angles. If we can show that MMBC = 2a as well, then we will know 
that AD is parallel to BC. 


The interior angles of a regular pentagon are all 108°, so 7M ED = 108° — 
a. Summing the angles of AM ED, we obtain 2a + a + 108° — a = 180°, 
soa = 36°. Thus ZMED = 108° — 36° = 72°, which equals 2a, making 
AM ED isosceles (so that MD = 1) and 2M BC = 72° just like ZE MD. So 
AD is parallel to BC as desired. 


[2] To solve 
AD(AD—1)=1 


for AD, we may let x = AD and write 
a(e-1)=1 or 2? -2=1. 
We complete the square by adding 1/4 to both sides, obtaining 
a” —x+1/4=5/4, which factorsas (a — 1/2)? =5/4. 


Taking the square root of both sides and solving for x leads us to the desired 
answer. 


[3] We must first establish that point G in Figure 1.6 does in fact lie on diagonal 
BD. Place the rectangle ABC'D on coordinate axes so that B lies at the origin, 
and BC lies along the horizontal axis. The slope of line BD equals 


CD 1 


BC 
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and the slope of line BG equals 

FG _o-l 

BF 1 
These slopes are equal by (1.2), so G lies on BD. Now we have shown ear- 
lier that rectangles ABCD and FCDE are similar, so the constructions within 
F CDE are all proportional to those within ABC'D. Repeating the argument 


above (forever) leads to the conclusion that w lies on the intersection of BD and 
CE. 


[4] The first identity » = y gives us a starting point, and the second identity 
vy”? = y + 1 comes from (1.4). Adding these equations yields 


pty? =2%+1, 
whereupon we use (1.7) to write the left side as ¢*. Now we may in like manner 


add y? = y + land y® = 2y + 1 to reach y* = 3y +4 2, and continue as long 
as we wish. 


[5] Some additional details that pertain to (1.9): 


By - ,[Ge+2DB-9) 
(e+ 23-9) 


_ /79 +6 — 39? 
pre 

_ /%e+6-3(y4+1) 
y+6—-(e+1) 


— /4p+3 
oo ieee : 


[6] Multiply inside the radical by 1 (in the form of (1/4) x 4) to obtain 


AM = \/(1/4) x 4(y? — (1/2)?) = (1/4) (49? — 1), 
and then bring 1/4 outside the radical as 1/2. 


(because y? = y + 1) 


[7] The proper way to write 


Soo = 1+ (2) 
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is 
1 
lim S, = lim 1 + — (3) 


iL 
141 
where there are n plus signs. Now we can apply basic properties of limits to move 
the limit inside the continued fraction. These properties allow us to write 


mae a 


lim S, =14 (4) 


noo 


tied 
where there are n — 1 plus signs in the limit on the right. That limit equals the 
one on the right side of (3), so if we let 


we may write (4) as 
z=1+-. (5) 
£ 
Looking back at (1.5), we see that x is the golden ratio y. Thus, we conclude not 


only that ¢ is the limiting ratio of the ratio of consecutive Fibonacci numbers, 
but also that 


poe (6) 


is a continued fraction for y. 
[8] We are assuming that » = m/n, so from 
: m 
1l<y<2 wecanwrite 1<— <2. 
n 
Multiplying through by n and then subtracting n from each of the three expres- 


sions yields 
O0<m-n<n. 


[9] The statement “If ¢ is rational, then ¥/5 is rational,” is called an implication, 
and takes the form “If P, then Q.” Such a claim is false only when P is true but 
Q is false. To see this, consider the implication “If your final grade is 90-100, 
then you get an A.” One of four outcomes is possible: 


Your final grade is 90-100 and you get an A. 
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Your final grade is below 90 and you do not get an A. 
Your final grade is below 90 and you get an A. 
Your final grade is 90-100 and you do not get an A. 


No one would challenge the implication if either of the first two outcomes occur. 
Nor should we claim that the implication is false if the third outcome should 
happen, although we might be a bit bewildered by the generosity. Only the final 
outcome shows the implication to be a lie, and this is where P is true but Q is 
false. 


Now we know by the equation 2p—1 = V5 that if y is rational (call this P), 
then \/5 is rational (call this Q). This implication is a true one, so we cannot have 
P true when Q is false. All we must do, then, is prove that \/5 is not rational, 
making @ false, and we will therefore know that P must also be false. 


[10] Substitute p = q+ r into p/q = q/r to obtain 


ee ee or g—rq=r’. 
q r 
We may use the quadratic formula to solve for qg (using the coefficients a = 1 
and 6 = —r and c = —r?) or, as we do here, we may complete the square. Add 


r? /4 to each side to reach 


and then factor the left while adding the right to see that 
( ‘) ound 5r? 
oo): ee? 


Finding the square root of each side and simplifying yields 


Ei AONE 14V5 
an aaa, i? 


Because q is a positive length, we may discard the negative solution and conclude 
that q = yr. 


Now from the original proportion we get pr = q? and with q = yr we find 
that p = gq. 


[11] In showing that an angle inscribed in a semicircle must be a right angle, we 
will make an assumption about parallel lines: if lines AB and DE are parallel (as 
in Figure 1), and a line AC cuts them, then angles CAQ and ACD are equal. In 
the same way, angles CBQ and BCE are equal. This intuitive assumption can 
be proved, but we will not do so here. 
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Figure 1 


As in the figure, let Q be the center of a semicircle with diameter AB and 
choose a point C' on the semicircle. Construct sides AC’ and BC of AABC 
along with radius QC’. Draw DE parallel to AB through C’. We wish to show 
that ZAC'B is a right angle. We are assuming (as described above) that 


ZCAQ = ZACD and 
ZCBQ = ZBCE. (7) 


Now 
ZACD + ZACB + ZBCE = 180° 


because these angles lie on a straight line, and therefore 


ZCAQ + ZACB + ZCBQ = 180° (8) 


when we make the replacements indicated in (7). As an aside, (8) indicates that 
the angles in a triangle sum to 180°. 


Because both AAQC and ABQC are isosceles, we know that 
ZCAQ = ZACQ and 
ZCBQ = ZBCQ. 
Substituting these results into (8) yields 
ZACQ + ZACB + ZBCQ = 180°. 


But ZACB clearly equals the sum of the other two on the left side of this equa- 
tion, so we may write 
2ZACB = 180°, 


from which the desired conclusion immediately follows. 
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[12] Here we quickly deal with the claim that the arithmetic geometric mean 
inequality 


1 n 
7 car Faz te + an) 2 (ayaz +++ an)" 


achieves equality when a; = a2 = --- = Gn. Let a denote the common value of 
the n numbers. Then the right side of the inequality is 


(aa---a)™ or (a")'/™ 


because there are n copies of a in the parentheses. This expression simplifies to 
a, as does the left side 


1 1 
7atat +a) or aC) 
when there are n copies of a. 


[13] We reach (2.2) from 


(QM)? + (or =1? and QM+MC=1 


by solving the first equation for (QM)?: 


2 a /1 1 
(QM) 1 Z 44 ay=sve a?, 


and then using the second equation to replace QM: 


(1— MC)? = 5vi ar. 


Finally, take the square root of each side, and solve for MC’. 


[14] The Pythagorean theorem applied to AAMC yields 


P= (2)" + (ucy, 


whereupon we may use (2.2) to replace M/C, then simplify: 
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[15] We would use (2.1) with n = 12 to find the area of a 24-gon. Because 
a is the length of the side of an n-gon, we would use (2.4) and relabel b as 


a= V2— V3. Thus 
area of a 24-gon = < 


_ ( ae 3.10583. 


The 24-gon approximates 7 ~ 3.14159 more closely than does a 12-gon, which 
has area 3. If we sought an even closer approximation to 7 by finding the area of 
a 48-gon, we would first use (2.4) to find that 


bP =2-V/4-a? 


aealopvs 


is the length of one side of a 24-gon. This is the value we would use for a when 
we looped back to (2.1) to find the area of a 48-gon. 


or 


a 


: 


[16] To establish (2.5), let 


Sx=ltatertte® t-te 


and multiply both sides by (1 — x) to obtain 


(1—2)Sn =(1-a2)(l+a4+a?+a°4 + x") 
=l-grta—a tae. e+e" — gt 
=|—-7""', 


Dividing by (1 — «) and taking the limit yields 


lim S, = lim +—* 
noo n—oo —-2 1-2 


so long as x € (—1,1). 
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[17] We have constructed the angle a so that tana = 1/5, so by (2.9) we see 
that 


tana + tana 
1—tanatana 
1/54+ 1/5 

1— (175)? 

_ 2/5 

ag 24/25 

Be 

120 


The second application of the identity follows similarly: 


tan(a+a) = 


tan 2a + tan 2a 
1 — 2tan2qa-2tan2a 


tan (2a + 2a) = 
_ 5/12 +5/12 
~ 1— (5/1272 


_ 120 
~ 119° 


[18] Because tan 7/4 = 1 we can write 


120 1+ tan(ZBQE) 
119 +1-—tan(ZBQE) 


and then cross-multiply to reach 
120(1 — tan (ZBQE)) = 119(1 + tan (ZBQE)). 
Distributing and rearranging terms brings us to 
239 tan (ZBQE) = 1, 
from which the result tan (ZBQE) = 1/239 follows. 


[19] The integral 
[ a 2?) /2 de 
gives us the area of a quarter of : unit circle because the curve 
y =(1—27)'/? canbewritten x2? +y? =1, 


which is the equation of a unit circle, and then integrating from « = 0 tox = 1 
finds the area in just the first quadrant. We would then evaluate the integral via 
the substitution « = sin a and the identity 


2 cos (2a) +1 
Con SS 
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[20] On the interval x € [0, 7/2], we know that 
O0<sing <1, 
and multiplying through by sin x gives us 
O< sin” x < sing. 
These two double inequalities combine to give us 
sin? x <sing <1. 


Now we multiply through by sin?” x to reach the desired double inequality 


2n+1 2n 


Qn+2 : 
not a< sin’” a. 


si x < sin 
At no point do we change the directions of the inequalities, because we never 
multiplied by a negative number. 


[21] For example, if n = 1 then we are flipping the coin two times, looking 
for the probability that we will flip one head and one tail. The 2? = 4 possible 
outcomes are 


AH, HAT,TH,TT 
and two of the four are the desired outcomes. So the probability P, of getting 
one heads and one tails in two flips is 


Peete 


Now if n = 2 so that we are flipping four times, then direct counting of the de- 
sired outcomes becomes more tedious, and we look for a more robust approach. 
We wish to count the number of ways that we can flip two heads and two tails. If 
we choose two of the four flips to be heads, then the remaining two flips must be 
tails, so all that we must do is count the number of ways that we can assign two 
of the four flips to be heads. The binomial coefficient 


4 4! 
(2) plat 


does the job. By the same argument, the number of ways that we can assign n of 


2n flips to be heads is 
2n\ _ (2n)! 
nn} ontnt~ 


ni2” =2x4x6x---x2n 


[22] The claim 
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merely reflects a shuffling of the factors on the left. For example, when n = 3 
we have 


312% = (1-2-3). (2-2-2) =(1-2)-(2-2)- (3-2) =2-4-6. 


[23] We reach (2.18) from 


Lge ee ee: ee ee Qn _2n 
y eis Whee gen ers eS) In-1 °° A+ 


by ignoring the 1 in the first fraction and factoring (2n+1) from the denominator, 
allowing us to write the expression in the form 


2x2x4x4x---x 2nx 2n 1 
3xX3x5x5x---x (2n—1)x (Qn—1) 2n417 


We may express this as 


2x4x---x (Qn) 7? 1 
3xX5x-+++x (Qn—1)] 2n+10 


[24] Substituting 7/2 for x in (2.20) yields 


=2=(-)() (DODD 


or, using sin 7/2 = 1 and then inverting both sides, 


T Big Bg ele Oo O 
2 1 3 3 5 5 7 


[25] Make the substitution 


1 
(—1)(%)(—2m)(2m)(—3m) (37) --- 
in (2.21) and divide by x to reach 


A= 


sing _ (a — m)(x + 1)(a — 277) (x + 277) (a — 37) (a + 30) --- 
z (<1) ()(—2m) (On)(—Bn) (Bm) -— 


Pairing factors from numerator to denominator yields 


sin x ea-nm e2+n e2-2n 24+2n 4-—3n 4430 


x —1 T —20 20 —37 30 


and then, dividing through each fraction by its denominator, we reach 


= (1-2) (42) (-8) (HB) OB) 
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[26] When n = 0, the result is clear. When n = 1, the right side of (2.24) equals 
1 1 m+2—(m-+1) 


m+1 m+4+2 (m+ 1)(m + 2) 


- 1 
(m+ 1)(m+4 2) ° 


And when n = 2, we get 


1 2 4 2 
m+l m+2- %m-+3)’ 


which upon finding a least common denominator is equivalent to 
(m + 2)(m + 3) — 2(m + 1)(m + 3) + (m+ 1)(m + 2) 
(m + 1)(m + 2)(m + 3) : 
The numerator simplifies to 2 once we expand and collect like terms. In the text, 


we write the numerator as 1-2 because of the factorial pattern that appears across 
the numerators. 


[27] Euler provided the following derivation of the series for y = arcsin x. The 


differential is 
dx 


V1 — x? 


and applying the generalized binomial theorem (2.23), Euler reached 


dy = = (1—27)"\/? dz, 


me —1)(-3 
ay = (14+ S2-at) 4 SPE ae 
—1)(_3)y(_ 8 j 
+! nt i 2) av’)? +--+.) dx 
x (tebe tebe) a 


Integrating yields 


Seu Ul eas, 1-3 zo 1-3-5 7 
v 26° BAe 
and because y = arcsin x, we are finished. 


[28] We can illustrate the two implications by example. When n = 2, we have 


f(x) = x e qx) 


x? (p” — 2pqx + q°2”) 
2! 
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So the terms range in degree from n = 2 to 2n = 4 as promised. Now 
f(z) = px — 3pgx” +2q’2° and 
f" (x) =p? — 6pga + 69° 2”. 


The text claims that the first » — 1 derivatives will equal zero when evaluated at 
x = 0, and we see that this is true here because every term in f’(x) contains a 
power of x. Second, the text claims that the denominator 2! in f(a) will be fully 
canceled in f’’(2), which we see is indeed the case. 


[29] For 


we have 


((p/q) - 2)" (p- q((p/q) — x))" 


f((p/q) — 2) = 


ni 
_ ((1/a)(p — ax)" (ae)” 
n! 
(p — qar)"x”" 


n! 


[30] To show that p” /n! (which is positive) can be made as small as we wish (in 
particular, for our purposes, we wish it to be less than 1), let’s write 


p pe. P . Pp ae Se _P 
nl pl! pt+l p+2 n- 


Each fraction on the right (after the first two) is smaller than p/(p + 1), so we 
may write 


and then choose n large enough to force the expression on the right below 1. 


[31] First, note that no matter how we split the number 10 into n parts, the 
arithmetic mean of the parts is constant. To see this, split a positive number 
into n parts, as in 

a1 +a2+-+:+an = M, 


and multiply by 1/n to get 


1 M 
—(a1 +a2+-+--+4n)=—. 
n n 
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On the left we have the arithmetic mean of the n parts, while the right is a constant 
that is independent of how we chose those parts. 


Now the arithmetic geometric mean inequality states that 


1 n 
7 + a2 +++++4n) > (a1a2+++an)'/ 


with equality precisely when a1 = a2 = --- = Gn, So if (as we just observed) 
the left side is constant, then the product on the right will be maximized when 


the inequality is an equality. This happens when all of the parts are equal. 


[32] Rearrange the factors in 


(1+ 7)" a14%. 248.2 12 ~n n-l n—-2 « 
‘on ln 1 2 n2' 1 2 3 on 
to reach 
r\" n 2 n n—-1 2? n n-1l n—-2 2 
(1+ ) =l1l+-. 1 : ‘ ania : : Aha 
n n lton n 2! n n n 3! 


Divide each term in each fraction (that contains n) by n to obtain 


x\7 x ,1—-(/n) « | 1—(/n) 1—-(2/n) « 
ee) ee i | 1 3! 


As n approaches infinity, all terms on the right in parentheses vanish. 


[33] We will show that 


a ae: 
oa e 
by first proving that 
—Inz= z : 
dx x 
The limit definition of the derivative 
d : x+h)— f(a 


see h 


and we may combine the logarithms in the numerator to write 


Lae eae aa eal 
dx n>0|h x ( 
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For a strictly strategic purpose, we choose to write 1/h as (1/x)(x/h), and so 


d Inz = lim (4:Z)m(+4)]. 
x noo|\a Ah x 


The fraction 1/x does not change as h approaches zero, so it may be brought 
outside the limit as a constant, and its partner x/h pulls up into the exponent; 
hence, 


The natural logarithm function is one-to-one and continuous, so we may ex- 
change it with the limit to reach 


a/h 
Oa Cites lim tee : (9) 
dx x h—0 x 


We will recognize the limit on the right if we letn = 2/h, so that n > oo as 


h — O and the limit becomes 
1 n 
lim (1 + x) : 
n—oo vax 


Section 3.4 offers a proof (based on the arithmetic geometric mean inequality) 
that this limit equals e. Thus from (9) we see that 


dx 


With this established, let y = e” so that x = In y and use implicit differentiation 
(with respect to x) on the latter to get 


1 dy dy 
1=-.- = 
y dx Ode 
and because y = e”, we have our desired result 
e =e 
dr 


[34] We are proving Bernoulli’s inequality: 


Claim. If m is a natural number and y > —1, then 


(1+y)" >1+my. 


Our induction proof begins by showing that the claim holds form = 1 (and 
we did this in the section itself). Now we assume that the claim is true for a fixed 
natural number m and show that via this assumption we can reach the next case: 
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To show: If m + 1 is a natural number and y > —1, then 


G+g)"C S14 (mt iy 


Starting with the left side, we may write 


(lt+yy™" =(1ty)™(L+y) 


IV 


(1+ my)(1+ y) 


by first factoring and then by appealing to our assumption that the claim holds for 
m. It is here that we require (1+ y) > 0 (so y > —1) in order that the inequality 
be true. Now we may expand the last expression to see that 


(l+y)™** > 1+ (m+ ljy+my? 


= 1+ (m Tv 1)y, 
which is what we set out to show. 


[35] From x > 0 we know that x/e > 0, and then subtracting 1 gives us 


L—e 


Ea: lee saa: 
e e 
[36] We may write 
SGV PS! gg. ghlegel SSeS e oy 
e e 
and then multiply through by e to reach 
e"/° > x. 


Taking the natural logarithm of both sides, we obtain 


Because x > 0, we do not need to reverse the inequality in this last step. Now 
exponentiating both sides leads to the desired result. 


[37] [recommend that you use technology to take a look at the first few members 
of the family of curves 
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namely, 
1 0-2 x 
Yo = 0! ve =e, 
1 1 -—a —x 
y= Tr cre =aze ’, 
oy AM yee. 3 xe * 
Ge Ore a? 


and so on. Seeing the family of curves may help you follow the forthcoming 
argument. 


[38] Recall that we are only interested in determining if the derivative 


dx 


equals zero on the interval x € (0,1). The first factor x”~' can only equal zero if 
x = 0, and the second factor e~® can never equal zero. The third factor (n — x) is 
always positive because n is a natural number. Because none of the three factors 
can equal zero on the interval x € (0,1), we conclude that the curve has no 
critical points (and therefore no maximums) on that interval. 


1 
ai =[ xe "dx 
0 


using integration by parts with 


[39] We calculate 


U=x so du= dz, 
dvu=e "dx so v =-e”, 
so 
‘ 1 1 1 
a= [-se~*], +f e dx = —— +f e "daz. 
0 € 0 


[40] For example, suppose that e = 19/7. Then we would split series (3.17) at 
the 1/7! term: 


eee oe re a epre es 
hea far er gi 8 


In preparation for what follows in the argument, note that multiplying each term 
by 7! turns each term in the first group to an integer. 


Extra help 165 


[41] The geometric series 


Vin tect te oan 
gah Kaede saat Ly 


has leading term a = 1/(q + 1) and common ratio r = 1/(q + 1). Thus the 
series sums to 


[42] To see why 
/ Ina dx ¥ Inl+1n2+---4+Inn, 
1 


redraw Figure 3.1 but use rectangles in place of the trapezoids. 


[43] Exponentiating 
L-1xnlnn-—n-Inn! + (1/2)Inn 


yields 


ee} ~ e@ Inn—n-Inn!+(1/2) nn 


and the right side can be written 


(enr)rem nyt? n/n 


er elnn! nlier ~ 


Thus, 


[44] The expression 


1 n 
Vr(n+ 1/2) Van 


n tl n? 
/2rn(n + 1/2) Qnn2 + 7n’ 


and then we can divide through each term by n? to reach 


et 
Qr+n/n- 


Letting n — co gives us 1/27. 


can be written 
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[45] The step 


lim Sen = lim Sy 
noo noo 


is justified by a theorem from the theory of limits: If a sequence converges to a 
limit, then every subsequence converges to the same limit. The sequence {S2,,} 
simply picks out the even-numbered entries of {.S;,} and is therefore a subse- 
quence. 


[46] From (3.24) we may write 


Von 


and, taking the natural logarithm of each side, we find that 


L-1=In(v2n)™* or L=1—In(V2rz). 


[47] Let Q denote the infinite tail end of the right side of 


1 1 
1 : 14 : 
Ate 5 
1+ 
3 
2+ 
3+-: 
so that we have 
1 és Q 2 
= where = 
14 : 14 . 2+ : 
T T 
e-1 14+Q Say. 
Rewriting this expression as 
1 = 1 
1+ : 14 : 
e—1 ' (24+Q)-1 


shows that e = 2 + Q, which yields the desired result. 
[48] Expanding 
(u+v)? =6(u+v) +10 


yields 
uw? + 3u2u + 3uv? + v? = bu + 6u + 10. 


Subtracting 6u and 6v gives us 


u? +u° + 3u7u — 6u + 3uv” — 6v = 10 
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or 
u? +0? + (3uv — 6)u + (3uv — 6)u = 10, 


and factoring (3uv — 6) from the appropriate terms gives us the desired result. 


[49] Substituting the first of the two solutions u = (5+ V/17)!/° into u? +03 = 
10 gives us 


(5 ms vi7)"*) +v2=10 or v?=10-(54+ViI7), 
from which we conclude that 
v =(5-V17)'”. 


In the same way, starting with the second solution u = (5 — Vv 17)'/ 3 leads to 
v= (54+ f1T\ >. So because x = u + v, we only get one solution for x, 


namely 
x =(54+V17)'? + (5-Vv17)"”?. 


[50] As an example of depressing a cubic, let’s rid the polynomial 
de® + 8a? +242 


of its x? term by first dividing by the leading coefficient 4 to obtain 


‘ 1 1 
3 2 

4+ 2¢°+—-a“24+= 

x zc x 5 


and then identifying from this that b = 2. So to shift the curve appropriately, we 
substitute « — 2/3 for x and get 


eae ine x 2 ae c z ne or gu ed 
3 3 4 3 2 12 27 
as desired. Now we can use del Ferro’s technique to find the zeros of this shifted 


cubic polynomial, and shift them by 2/3 to get the zeros of the original cubic 
polynomial. 


[51] Multiply both sides of 
a® — 2q 


3p > 
by a (thus reversing the inequality because a < 0) to get 
sap < a — 2q. 
2 


From p = a” we have ap = a? and so 


3ap <ap—2q or 2ap < —2¢. 
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Dividing by —2 yields 
—ap>q 


and we square both sides to reach 
ap’ > ¢q. 
Using p = a? again, we soon reach the desired result 
¢ = p <0. 
[52] My thanks to Chenchen Zhao and Ben Oltsik for showing me this alternative 
solution. In it, we will use the customary symbol 7 to represent ./—1. We wish to 


simplify 
(2421)? + 2-21", 


Let 
x = (24+2i)'/%, soa? =2+42i, and 
y = (2—2i)'/%, soy? = 2— 2i. 


The question becomes: what is x + y? This sum appears as a factor on the right 


of the identity 


> _ ay ty’). (10) 


e+y=(r+y)(e 
On the left, 


ety =24+2%4+2-%=4, 
while on the right, 


ay = (2 + 24)'/3(2 — 24/3 = ((2 + 24)(2 — 28))'”* = (4 — (-4)) 73 =2, 


and 


x? = ((2+28)?)'/? = (44 81 —4)173 = 21/8, 


y? = ((2—2%)?)'? = (4-84 — 4)1/8 = —21/8, 


Thus, from (10) we have 

4= (a+ y)(2i'/? — 2 — 231/), 
or x + y = —2 as Bombelli found. 
[53] From Figure 4.4, we have 


cos 8 = or a=rcos6, 


sin? = or b=rsin8, 


Soros: 
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so 
a+bi=rcos@+rsin@-i=r(cosé+isin@). 


[54] Before we prove anything about how to perform multiplication in the com- 
plex plane, we will observe what happens for an example. As a vector in the 
complex plane, the number 1 + 7 has magnitude 


t+iy=JVP+P=V2 


and creates angle 7/4 with the horizontal axis. When we square 1 + 7 we get 


(l4+)14+i =14+2i-1=21. 


The complex number 27 has magnitude 2 and angle 7/2. It is no coincidence that 


jl +4]. |1 +2] = V2- V2 =2 = |2i|, and 


L£1+) +2404) =F4+9= 5 =4(2). 


Let’s prove that we add the angles of two complex numbers when we find their 
product. The angle 6 in Figure 4.4 can be expressed 


0 = arctan b/a. 


Suppose we have a second complex number c+di with its angle a = arctan d/c. 
Then 


6+ a = arctan b/a + arctan d/c 


ad + bc 
= arct ———— 11 
arctan (22°C), (11) 
using the identity 
arctana + arctan 6 = arctan ae? 
1-—aBp 
for the last step. Now 
(a + bi)(c + di) = (ac — bd) + (ad + be)i (12) 
makes the angle 
arctan asia 
ac — bd 


with the horizontal axis, and this matches with (11). Thus, when we multiply two 
complex numbers, we add the angles of their corresponding vectors. 


Now why do we multiply the lengths of a + bz and c+ di when we find their 
product? From Figure 4.4 and the Pythagorean theorem, we see that 


la + bil -|e+ di] = (a? +B)7°(P +)? (13) 
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and from (12) we have 


1/2 


\(a + bi)(c + di)| = [(ac bd)? + (ad 4 be)? | 


= [(ac)? + (bd)? + (ad)? + (be)?]*/” 


= [(a? +0)(2 +4)”, 


from which we can reach (13). So we multiply the magnitudes of two complex 
numbers when we find their product, rather than add as we do with the angles. 


[55] From cos (7/6) = 3/2 and the identity 


ee A\_ /1+cosA 
2 2 
we know that 


Yoox (2) = v3.4] + v2 ane Sn 2N8 
2 2 2 ‘ 


(14) 
Now (as an aside) we note that 
(1+ V3)(1+ V3) =44+2V3 so 14+V3=1/44 2v3, 


which lets us write the last expression in (14) as 


1+¥V3 
2 


as desired. The result for /2 sin (1/12) follows similarly. 


Historical note. An expression like 


4+2V3 


is called a nested radical, and writing it in a simpler form is possible only in 
certain circumstances. Even then, the process is a bit tricky. Because nested rad- 
icals appear in a variety of problems of historical importance (such as Liu Hui’s 
approximation of 7 in section 2.1, or the solution (4.5) to the depressed cubic), 
simplifying nested radicals has attracted attention for centuries. Progress contin- 
ues to this day; for example, in 1992, Susan Landau (United States, born 1954) 
discovered the first algorithm that can simplify a general nested radical. 


[56] We use the half-angle identity 


: A /1—cosA : 17x 
sin ( >) + 5 with A= a 
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and, because A is in the second quadrant, we choose the negative version. So 


__ [1+ v3/2) 
2 


=~ 5y24V5. 


Similarly, using the half-angle identity for cosine, we find that 


177 1 / 
cos (FZ) =-5 2— V3. 


Thus, 


V2 (cos (177/12) + isin (177/12)) 
v2( sy? v3 5 V2+ v3) 


F (v2 V3-+ iy/2+ v3). 


[57] Add the fractions on the right to get 


(A+ Bi)(1 — ix) + (C + Di)(1 + iz) 
(1 + iz)(1 — iz) 


Expanding and collecting like terms in the numerator yields 


(A+ C)+(B+ D)i] + [(B- D)+(C-— A)ilax 
(1 + ix)(1 — iz) 


We need to find A, B, C, D so that the numerator equals 1, or equivalently, 


[1 + 04] + [0 + Oi] x. 


(15) 


(16) 


Matching the coefficients within brackets in (15) and (16), we obtain the system 


of equations 


C-A=0 B-D=0, 
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which has solutions A = C = 1/2 and B= D=0. 


[58] Substituting z = tana and dz = sec? a da in 


sec? ada 


O=% givesus 0=i% 


‘ dz 
Jiez 


and then the identity sec? a = 1 + tan? a leads to 


g=i [ seca da. 


[59] Multiply within the logarithm of 
0=iln(V1—y? — ty) 
by (/1 — y? + iy) over itself to get 


nim (CAF ee) . 
Jl —y? +4y 


The numerator, if expanded, simplifies to 1. Thus we may write 
9 =iln (v1 aye iy] 
= —iln(/1—y? + ity). 


[60] The substitution x = 7/2 in 
e'” = cos@ + isin 


yields 


e'"/2) — cos (m/2) +isin(r/2)=O0+i, so el!) =i. 


Taking the natural logarithm of each side, and then solving for 7, gives us 


21ni 
T= 7 , 
i 


whereas raising each side of (17) to the power 7 leads to 


2 , , 
2 3 : —x/2 
ef 7/2) jt op fh =e T/?. 


[61] Squaring and multiplying by x? converts 


Z = (4943 to y? =27(4y 43), 


V1+tan?a 


(17) 
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and using x”? + y” = 1 to replace y? gives us 
1-2? =2°(49+3) of 1=27(4944) =1=407(p 41). 
From (1.8) we can write 


1=42°¢" or £= — 


as desired. From 


— V4et+3  /4p+3 
20 Ay? 
we may simplify by multiplying both numerator and denominator by 2 — y to 
get 


(494+3)2-y)  /p+2 _ vere 
2 


[62] The half-angle identity for cosine is 


cos (2) = cosa + 1 
QQ) VY 2 , 


so with a = 27/5 we get 


We know that cos (27/5) = 1/(2y) from (4.26), and so, substituting and sim- 
plifying, we find that 


3 
= ,4/— from (1.8) 
? 
= »/2. 
[63] To fill in the p = 1, k = 3 entry of Table A.1, we would find the area under 


the curve y = (1 — )? inside the unit square. As in (A.1), we expand: 


area under (1 — a)* = area under (1 — 32 + 3a” — x”) 
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So we put the reciprocal of 1/4 in the appropriate spot in the table. 


As another example, we fill in the p = 2, k = 2 entry by calculating 


area under (1 — gi/?)? = area under (1 — Qatl? 4 x) 


i 
1 
- 
| 
+ 


The reciprocal 6 therefore belongs in the corresponding spot in the table. 


[64] For example, we use Wallis’s equation 


#dotstotal — 1 ( 
# dots along base 2 


n+1) 


with n = 5/2 (where n is the number of dots along the base) to find that 


# dots total 1 
5/2 2 


€ + 1) andso # dots total = 


oo| & 
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